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This review gives a pedagogical introduction to the eigenstate thermalization hypothesis (ETH), 
its basis, and its implications to statistical mechanics and thermodynamics. In the first part, ETH is 
introduced as a natural extension of ideas from quantum chaos and random matrix theory. To this 
end, we present a brief overview of classical and quantum chaos, as well as random matrix theory and 
some of its most important predictions. The latter include the statistics of energy levels, eigenstate 
components, and matrix elements of observables. Building on these, we introduce the ETH and show 
that it allows one to describe thermalization in isolated chaotic systems without invoking the notion 
of an external bath. We examine numerical evidence of eigenstate thermalization from studies of 
many-body lattice systems. We also introduce the concept of a quench as a means of taking isolated 
systems out of equilibrium, and discuss results of numerical experiments on quantum quenches. The 
second part of the review explores the implications of quantum chaos and ETH to thermodynamics. 
Basic thermodynamic relations are derived, including the second law of thermodynamics, the funda¬ 
mental thermodynamic relation, fluctuation theorems, the fluctuation-dissipation relation, and the 
Einstein and Onsager relations. In particular, it is shown that quantum chaos allows one to prove 
these relations for individual Hamiltonian eigenstates and thus extend them to arbitrary stationary 
statistical ensembles. In some cases, it is possible to extend their regimes of applicability beyond 
the standard thermal equilibrium domain. We then show how one can use these relations to obtain 
nontrivial universal energy distributions in continuously driven systems. At the end of the review, 
we briefly discuss the relaxation dynamics and description after relaxation of integrable quantum 
systems, for which ETH is violated. We present results from numerical experiments and analytical 
studies of quantum quenches at integrability. We introduce the concept of the generalized Gibbs 
ensemble, and discuss its connection with ideas of prethermalization in weakly interacting systems. 

Keywords: quantum statistical mechanics; eigenstate thermalization; quantum chaos; random 
matrix theory; quantum quench; quantum thermodynamics; generalized Gibbs ensemble. 


Verily at the first Chaos came to be, 
but next wide-bosomed Earth, 
the ever-sure foundations of all ... 

Hesiod, Theogony 
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1. Introduction 

Despite the huge success of statistical mechanics in describing the macroscopic behavior of 
physical systems mm, its relation to the underlying microscopic dynamics has remained 
a subject of debate since the foundations were laid mm- One of the most controversial 
topics has been the reconciliation of the time reversibility of most microscopic laws of 
nature and the apparent irreversibility of the laws of thermodynamics. 

Let us first consider an isolated classical system subject to some macroscopic con¬ 
straints (such as conservation of the total energy and confinement to a container). To 
derive its equilibrium properties, within statistical mechanics, one takes a fictitious en¬ 
semble of systems evolving under the same Hamiltonian and subject to the same macro¬ 
scopic constraints. Then, a probability is assigned to each member of the ensemble, and 
the macroscopic behavior of the system is computed by averaging over the fictitious en¬ 
semble [5]. For an isolated system, the ensemble is typically chosen to be the microcanon- 
ical one. To ensure that the probability of each configuration in phase space does not 
change in time under the Hamiltonian dynamics, as required by equilibrium, the ensem¬ 
ble includes, with equal probability, all configurations compatible with the macroscopic 
constraints. The correctness of the procedure used in statistical mechanics to describe 
real systems is, however, far from obvious. In actual experiments, there is generally no 
ensemble of systems - there is one system - and the relation between the calculation just 
outlined and the measurable outcome of the underlying microscopic dynamics is often 
unclear. To address this issue, two major lines of thought have been offered. 

In the first line of thought, which is found in most textbooks, one invokes the ergodic 
hypothesis [6] (refinements such as mixing are also invoked [7|). This hypothesis states 
that during its time evolution an ergodic system visits every region in phase space (sub¬ 
jected to the macroscopic constraints) and that, in the long-time limit, the time spent 
in each region is proportional to its volume. Time averages can then be said to be equal 
to ensemble averages, and the latter are the ones that are ultimately computed [6]. The 
ergodic hypothesis essentially implies that the “equal probability” assumption used to 
build the microcanonical ensemble is the necessary ingredient to capture the long-time 
average of observables. This hypothesis has been proved for a few systems, such as the 
Sinai billiard mm, the Bunimovich stadium m, and systems with more than two hard 
spheres on a d-dimensional torus {d > 2) [11] . 

Proving that there are systems that are ergodic is an important step towards having 
a mathematical foundation of statistical mechanics. However, a few words of caution are 
necessary. First, the time scales needed for a system to explore phase space are expo¬ 
nentially large in the number of degrees of freedom, that is, they are irrelevant to what 
one observes in macroscopic systems. Second, the ergodic hypothesis implies thermal- 
ization only in a weak sense. Weak refers to the fact that the ergodic hypothesis deals 
with long-time averages of observables and not with the values of the observables at long 
times. These two can be very different. Ideally, one would like to prove thermalization 
in a strong sense, namely, that instantaneous values of observables approach the equilib¬ 
rium value predicted by the microcanonical ensemble and remain close to it at almost 
all subsequent times. This is what is seen in most experiments involving macroscopic 
systems. Within the strong thermalization scenario, the instantaneous values of observ¬ 
ables are nevertheless expected to deviate, at some rare times, from their typical value. 
For a system that starts its dynamics with a non-typical value of an observable, this is, 
in fact, guaranteed by the Poincare recurrence theorem [12]. This theorem states that 
during its time evolution any finite system eventually returns arbitrarily close to the 
initial state. However, the Poincare recurrence time is exponentially long in the num¬ 
ber of degrees of freedom and is not relevant to observations in macroscopic systems. 
Moreover, such recurrences are not at odds with statistical mechanics, which allows for 
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atypical configurations to occur with exponentially small probabilities. We should stress 
that, while the ergodic hypothesis is expected to hold for most interacting systems, there 
are notable exceptions, particularly in low dimensions. For example, in one dimension, 
there are many known examples of (integrable or near integrabl^ systems that do not 
thermalize, not even in the weak sense m- A famous example is the Fermi-Pasta-Ulam 
numerical experiment in a chain of anharmonic oscillators, for which the most recent 
results show no (or extremely slow) thermalization |14[ I15j . This problem had a major 
impact in the field of nonlinear physics and classical chaos (see, e.g., Ref. HU). Relax¬ 
ation towards equilibrium can also be extremely slow in turbulent systems m and in 
glassy systems [T8] . 

In the second, perhaps more appealing, line of thought one notes that macroscopic 
observables essentially exhibit the same values in almost all configurations in phase space 
that are compatible with a given set of macroscopic constraints. In other words, almost 
all the configurations are equivalent from the point of view of macroscopic observables. 
For example, the number of configurations in which the particles are divided equally (up 
to non-extensive corrections) between two halves of a container is exponentially larger 
than configurations in which this is not the case. Noting that “typical” configurations 
vastly outnumber “atypical” ones and that, under chaotic dynamics each configuration 
is reached with equal probability, it follows that “atypical” configurations quickly evolve 
into “typical” ones, which almost never evolve back into “atypical” configurations. Within 
this line of thought, thermalization boils down to reaching a “typical” configuration. This 
happens much faster than any relevant exploration of phase space required by ergodicity. 
Note that this approach only applies when the measured quantity is macroscopic (such 
as the particle number mentioned in the example considered above). If one asks for 
the probability of being in a specific microscopic configuration, there is no meaning in 
separating “typical” from “atypical” configurations and the predictive power of this line 
of reasoning is lost. As appealing as this line of reasoning is, it lacks rigorous support. 

Taking the second point of view, it is worth noting that while most configurations 
in phase space are “typical”, such configurations are difficult to create using external 
perturbations. For example, imagine a piston is moved to compress air in a container. If 
the piston is not moved slowly enough, the gas inside the container will not have time 
to equilibrate and, as a result, during the piston’s motion (and right after the piston 
stops) its density will not be uniform, that is, the system is not in a “typical” state. In a 
similar fashion, by applying a radiation pulse to a system, one will typically excite some 
degrees of freedom resonantly, for example, phonons directly coupled to the radiation. As 
a result, right after the pulse ends, the system is in an atypical state. Considering a wide 
range of experimental protocols, one can actually convince oneself that it is generally 
difficult to create “typical” configurations if one does not follow a very slow protocol or 
without letting the system evolve by itself. 

At this point, a comment about time-reversal symmetry is in order. While the micro¬ 
scopic laws of physics usually exhibit time-reversal symmetry, notable exceptions include 
systems with external magnetic fields, the resulting macroscopic equations used to de¬ 
scribe thermodynamic systems do not exhibit such a symmetry. This can be justified 
using the second line of reasoning - it is exponentially rare for a system to evolve into 
an “atypical” state by itself. Numerical experiments have been done (using integer arith¬ 
metic) in which a system was started in an atypical configuration, was left to evolve, 
and, after some time, the velocities of all particles were reversed. In those experiments, 
the system was seen to return to the initial (atypical) configuration |19j . Two essential 
points to be highlighted from these numerical simulations are: (i) after reaching the ini- 


^We briefly discuss classical integrability and chaos in Sec. and quantum integrability in Sec.j^ 
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tial configuration, the system continued its evolution towards typical configurations (as 
expected, in a time-symmetric fashion) and (ii) the time-reversal transformation needed 
to be carried out with exquisite accuracy to observe a return to the initial (atypical) con¬ 
figuration (hence, the need of integer arithmetic). The difficulty in achieving the return 
increases dramatically with increasing system size and with the time one waits before 
applying the time-reversal transformation. It is now well understood, in the context of 
fluctuation theorems |20l 121] . that violations of the second law (i.e., evolution from typi¬ 
cal to atypical conhgurations) can occur with a probability that decreases exponentially 
with the number of degrees of freedom in the system. These have been confirmed exper¬ 
imentally (see, e.g.. Ref. [22]). As part of this review, we derive fluctuation theorems in 
the context of quantum mechanics |23H25j . 

Remarkably, a recent breakthrough |26II28j has put the understanding of thermaliza- 
tion in quantum systems on more solid foundations than the one discussed so far for 
classical systems. This breakthrough falls under the title of the eigenstate thermalization 
hypothesis (ETH). This hypothesis can be formulated as a mathematical ansatz with 
strong predictive powers |29j . ETH and its implications for statistical mechanics and 
thermodynamics are the subject of the review. As we discuss, ETH combines ideas of 
chaos and typical conhgurations in a clear mathematical form that is unparalleled in clas¬ 
sical systems. This is remarkable considering that, in some sense, the relation between 
microscopic dynamics and statistical mechanics is more subtle in quantum mechanics 
than in classical mechanics. In fact, in quantum mechanics one usually does not use 
the notion of phase space as one cannot measure the positions and momenta of par¬ 
ticles simultaneously. The equation dictating the dynamics (Schrodinger’s equation) is 
linear which implies that the key ingredient leading to chaos in classical systems, that is, 
nonlinear equations of motion, is absent in quantum systems. 

As already noted by von Neumann in 1929, when discussing thermalization in isolated 
quantum systems one should focus on physical observables as opposed to wave functions 
or density matrices describing the entire system m- This approach is similar to the 
one described above for classical systems, in which the focus is put on macroscopic ob¬ 
servables and “typical” configurations. In this spirit, ETH states that the eigenstates of 
generic quantum Hamiltonians are “typical” in the sense that the statistical properties 
of physical observable^ are the same as those predicted by the microcanonical ensemble. 
As we will discuss, ETH implies that the expectation values of such observables as well 
as their fluctuations in isolated quantum systems far from equilibrium relax to (nearly) 
time-independent results that can be described using traditional statistical mechanics 
ensembles |26It28j . This has been verihed in several quantum lattice systems and, ac¬ 
cording to ETH, should occur in generic many-body quantum systems. We also discuss 
how ETH is related to quantum chaos in many-body systems, a subject pioneered by 
Wigner in the context of Nuclear Physics m- Eurthermore, we argue that one can build 
on ETH not only to understand the emergence of a statistical mechanics description 
in isolated quantum systems, but also to derive basic thermodynamic relations, linear 
response relations, and fluctuation theorems. 

When thinking about the topics discussed in this review some may complain about the 
fact that, unless the entire universe is considered, there is no such thing as an isolated 
system, i.e., that any description of a system of interest should involve a bath of some 
sort. While this observation is, strictly speaking, correct, it is sometimes experimentally 
irrelevant. The time scales dictating internal equilibration in “well-isolated” systems can 
be much faster than the time scales introduced by the coupling to the “outside world”. 
It then makes sense to question whether, in experiments with well-isolated systems, ob- 


^We will explain what we mean by physical observables when discussing the ETH ansatz. 
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servables can be described using statistical mechanics on time scales much shorter than 
those introduced by the coupling to the outside world. This question is of relevance to 
current experiments with a wide variety of systems. For example, in ultracold quantum 
gases that are trapped in ultrahigh vacuum by means of (up to a good approximation) 
conservative potentials j32l I33j . The near unitary dynamics of such systems has been 
observed in beautiful experiments on collapse and revival phenomena of bosonic j34H36] 
and fermionic m fields, lack of relaxation to the predictions of traditional ensembles 
of statistical mechanics [38til0] . and dynamics in optical lattices that were found to be 
in very good agreement with numerical predictions for unitary dynamics [JT]. In optical 
lattice experiments, the energy conservation constraint, imposed by the fact that the sys¬ 
tem are “isolated”, has also allowed the observation of counterintuitive phenomena such 
as the formation of stable repulsive bound atom pairs |l2] and quantum distillation |43] 
in ultracold bosonic systems. Other examples of nearly isolated systems include nuclear 
spins in diamond |44j . pump-probe experiments in correlated materials in which dynam¬ 
ics of electrons and holes are probed on time scales much faster than the relaxation time 
associated with electron-phonon interactions |45l 06] , and ensembles of ultra-relativistic 
particles generated in high-energy collisions 071. 

This review can naturally be separated in two parts, and an addendum. In the hrst 
part. Secs. we briefly introduce the concept of quantum chaos, discuss its relation 
to random matrix theory (RMT), and calculate its implications to observables. We then 
introduce ETH, which is a natural extension of RMT, and discuss its implications to 
thermalization in isolated systems, that is, relaxation of observables to the thermal equi¬ 
librium predictions. We illustrate these ideas with multiple numerical examples. In the 
second part. Secs. 113 we extend our discussion of the implications of quantum chaos and 
ETH to dynamical processes. We show how one can use quantum chaos and ETH to derive 
various thermodynamic relations (such as fluctuation theorems, fluctuation-dissipation 
relations, Onsager relations, and Einstein relations), determine leading hnite-size correc¬ 
tions to those relations, and, in some cases, generalize them (e.g., the Onsager relation) 
beyond equilibrium. Einally, in the addendum (Sec.[^, we discuss the relaxation dynam¬ 
ics and description after relaxation of integrable systems after a quench. We introduce 
the generalized Gibbs ensemble (GGE), and, using time-dependent perturbation theory, 
show how it can be used to derive kinetic equations. We note that some of these topics 
have been discussed in other recent reviews |48H53j and special journal issues IMlESj. 


2. Chaos and Random Matrix Theory (RMT) 

2.1. Classical Chaos 

In this section, we very briefly discuss chaotic dynamics in classical systems. We refer the 
readers to Refs. 06] ET], and the literature therein, for further information about this 
topic. As the focus of this review is on quantum chaos and the eigenstate thermalization 
hypothesis, we will not attempt to bridge classical chaos and thermalization. This has 
been a subject of continuous controversies. 

While there is no universally accepted rigorous dehnition of chaos, a system is usually 
considered chaotic if it exhibits a strong (exponential) sensitivity of phase-space trajec¬ 
tories to small perturbations. Although chaotic dynamics are generic, there is a class of 
systems for which dynamics are not chaotic. They are known as integrable systems [58] . 
Specihcally, a classical system whose Hamiltonian is R(p, q), with canonical coordinates 
H = ( 9 I) ■ • • ) Qn) and momenta p = (pi, • • • ,PAr), is said to be integrable if it has as many 
functionally independent conserved quantities I = (Ii, • • • , /at) in involution as degrees 


6 


August 2, 2016 


Advances in Physics Review 



Figure 1. Examples of trajectories of a particle bouncing in a cavity: (a) non-chaotic circular and (b) chaotic 
Bunimovich stadium. The images were taken from scholarpedia m- 


of freedom N: 


{Ij,H} = 0, {Ij,Ik} = 0, where 


If V ^9 df dg 

dpjdqj’ 


( 1 ) 


From Liouville’s integrability theorem [59], it follows that there is a canonical trans¬ 
formation [p, q) —)■ (1,0) (where 1 ,0 are called action-angle variables) such that 
H{p, q) = H{I) (58]. As a result, the solutions of the equations of motion for the action- 
angle variables are trivial: Ij{t) = I j = constant, and Qj{t) = -|- 0j(O). For obvious 

reasons, the motion is referred to as taking place on an V-dimensional torus, and it is 
not chaotic. 

To get a feeling for the differences between integrable and chaotic systems, in Fig. 
we illustrate the motion of a particle in both an integrable and a chaotic two-dimensional 
cavity [60]. Figure]^ a) illustrates the trajectory of a particle in an integrable circular 
cavity. It is visually apparent that the trajectory is a superposition of two periodic mo¬ 
tions along the radial and angular directions. This is a result of the system having two 
conserved quantities, energy and angular momentum m- Clearly, the long-time aver¬ 
age of the particle density does not correspond to a uniform probability which covers 
phase space. Figure [^b), on the other hand, shows a trajectory of a particle in a chaotic 
Bunimovich stadium [TO], which looks completely random. If one compares two trajec¬ 
tories that are initially very close to each other in phase space one finds that, after a 
few bounces against the walls, they become uncorrelated both in terms of positions and 
directions of motion. This is a consequence of chaotic dynamics. 

There are many examples of dynamical systems that exhibit chaotic behavior. A nec¬ 
essary, and often sufficient, condition for chaotic motion to occur is that the number of 
functionally independent conserved quantities (integrals of motion), which are in involu¬ 
tion, is smaller than the number of degrees of freedom. Otherwise, as mentioned before, 
the system is integrable and the dynamics is “simple”. This criterion immediately tells us 
that the motion of one particle, without internal degrees of freedom, in a one-dimensional 
system, described by a static Hamiltonian, is integrable. The energy provides a unique 
(up to a sign) relation between the coordinate and the momentum of the particle. In two 
dimensions, energy conservation is not sufficient to constrain the two components of the 
momentum at a given position in space, and chaos is possible. However, if an additional 
conservation law is present, e.g., angular momentum in the example of Fig. [^a), then 
the motion is regular. As a generalization of the above, a many-particle system is usu¬ 
ally considered chaotic if it does not have an extensive number of conserved quantities. 
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For example, an ensemble of noninteracting particles in high-dimensional systems is not 
chaotic in this sense, even if each particle exhibits chaotic motion in the part of phase 
space associated with its own degrees of freedom. This due to the fact that the energy 
of each particle is separately conserved. However, one expects that interactions between 
the particles will lead to chaotic motion. 

It is natural to ask what happens to an integrable system in the presence of a small 
integrability breaking perturbation. The KAM theorem (after Kolmogorov, Arnold, and 
Moser |62H64j 1 states that, under quite general conditions and for systems with a finite 
number of degrees of freedom, most of the tori that foliate phase space in the integrable 
limit persist under small perturbations [59]. This means that, in finite systems, there is 
a crossover between regular and chaotic dynamics. 

It is instructive to see how chaos emerges in simple system. The easiest way to do 
this is to study one particle in one dimension and remove the energy conservation by 
applying a time-dependent protocol. Very well-studied examples of such driven systems 
(usually exhibiting a coexistence of chaotic and regular motion in different parts of phase 
space) include the Fermi-Ulam model [65], the Kapitza pendulum [66]) and the kicked 
rotor (67] [68]. The latter example provides, perhaps, the simplest realization of a chaotic 
system. As an illustration, we discuss it in detail in Appendix [A] 


2.2. Random Matrix Theory 

A focus of this review is on eigenstate thermalization which, as we argue in the following, 
is closely related to quantum chaos (see, e.g., Refs. IMIEO], for numerical studies that 
discuss it). In this section, we review results from quantum chaos that will be needed 
later. We refer the readers to more complete reviews on quantum chaos and RMT for 
further details 

From the early days of quantum mechanics, it was clear that the classical notion 
of chaos does not directly apply to quantum-mechanical systems. The main reason is 
that Schrodinger’s equation is linear and therefore cannot have exponentially departing 
trajectories for the wave functions. As a matter of fact, the overlap between two different 
quantum states, evolved with the same Hamiltonian, is constant in time. Also, while 
quantum mechanics can be formulated in a phase-space language, for example, using the 
Wigner-Weyl quantization [751116], one still does not have the notion of a trajectory (and 
thus its sensitivity to small perturbations) since coordinates and momenta of particles 
cannot be defined simultaneously due to the uncertainty principle. It is then natural to 
ask what is the analogue of chaotic motion in quantum systems. 

To better understand this question, let us first consider the single-particle classical 
limit. For integrable systems, the physics was understood in the early days of quan¬ 
tum mechanics, based on Bohr’s initial insight. Along allowed trajectories, the classical 
reduced action satisfies the quantization condition: 

pdq Ri Inhn . (2) 

Namely, the classical action is quantized in units of h. In 1926, this conjecture was 
formalized by what is now known as the WKB (after Wentzel, Kramers, and Brillouin) 
approximation m- Essentially, the WKB quantization implies that, in the semi-classical 
limit, one has to discretize (quantize) classical trajectories. In chaotic systems, the situa¬ 
tion remained unclear for a very long time. In particular, it was not clear how to quantize 
classical chaotic trajectories, which are not closed (in phase space). Initial attempts to 
resolve these issues go back to Einstein who wrote a paper about them already in 1917 
(see Ref. [78] for details). However, the question was largely ignored until the 1970s when. 
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after a pioneering work by Gutzwiller m. it became the focus of much research broadly 
falling under the title of quantum chaos. To this day many questions remain unresolved, 
including the precise definition of quantum chaos [50] . 

A set of crucial results on which quantum chaos builds came from works of Wigner ini 
IHTl 152] who, followed by Dyson [HH] and others, developed a theory for understanding 
the spectra of complex atomic nuclei. This theory is now known as RMT [73] • RMT 
became one of the cornerstones of modern physics and, as we explain later, underlies 
our understanding of eigenstate thermalization. Wigner’s original idea was that it is 
hopeless to try to predict the exact energy levels and corresponding eigenstates of complex 
quantum-mechanical systems such as large nuclei. Instead, one should focus on their 
statistical properties. His second insight was that, if one looks into a small energy window 
where the density of states is constant, then the Hamiltonian, in a non fine-tuned basis, 
will look essentially like a random matrix. Therefore, by studying statistical properties 
of random matrices (subject to the symmetries of the Hamiltonian of interest, such as 
time-reversal symmetry), one can gain insights on the statistical properties of energy 
levels and eigenstates of complex systems. This latter insight was very revolutionary 
and counterintuitive. It should be noted that whenever we attempt to diagonalize many- 
body physical Hamiltonians, we usually write them in special bases in which the resulting 
matrices are very sparse and the nonzero matrix elements are anything but random. This, 
however, does not contradict Wigner’s idea which deals with “generic” bases. 

The main ideas of RMT and the statistics of the energy levels (known as Wigner-Dyson 
statistics) can be understood using 2x2 Hamiltonians whose entries are random numbers 
taken from a Gaussian distribution [7IH7i|: 


H = 


£l 

V2 


_V_ 

x/2 

£2 


(3) 


Here the factor l/\/2 in the off-diagonal matrix elements is introduced since, as it will 
become clear soon, this choice leaves the form of the Hamiltonian invariant under basis 
rotations. The Hamiltonian in Eq. Q can be easily diagonalized and the eigenvalues are 

^^i ,2 = ^4^±2^(ei-e2)2 + 2|V|2. (4) 

If the system is invariant under time reversal (e.g., there is no external magnetic field) 
then the Hamiltonian can be written as a real matrix, so R = V*. For simplicity, we 
draw ei, e 2 , and V from a Gaussian distribution with zero mean and variance a. Using 
Eq. @ one can compute the statistics of the level separations P{Ei — E 2 = a;) = 

(here and in what follows, unless otherwise specified, we set h to unity): 

= (2^)3/2^3 / / ^^2 I dV 5 (V(ei-62)2 + 21/2 _ gxp . 

(5) 

Before evaluating the integral over ei, we make a change of variables 62 = 61 + V2^. 
Then, integrating over 61 , which is a Gaussian integral, we are left with 

^(w) = 2 ^/ I d^dV6(^,/2e + 2V^-u;)exp . ( 6 ) 

The latter integrals can be evaluated using cylindrical coordinates, V = rcos(a:), ^ = 
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rsin(x), and one finds: 


P{u;) 


OJ 

2^2 


exp 



(7) 


In the absence of time-reversal symmetry, 3f?[V] and can be treated as independent 
random variables and, carrying out a similar calculation using spherical coordinates, 
leads to: 


P{u) 




exp 



( 8 ) 


These distributions exhibit some remarkable (generic) properties: (i) there is level 
repulsion since the probability P{u)) of having energy separation oj vanishes as w —>• 
0 and (ii) the probability decays as a Gaussian at large energy separation. The two 
distributions 0 and Q can be written as 

P{oj) = ex.\i[—BpuP‘], (9) 


where /3 = 1 in systems with time-reversal symmetry and /3 = 2 in systems that do not 
have time-reversal symmetry. The coefficients Ap and Bp are found by normalizing P{uj) 
and fixing the mean level spacing. The normalized distributions, with an average level 
spacing set to one, are given by 


TT 


Pi{^) = 


TT 2 


32 

T’ 2 (w) = exp 


TT 


- UJ 

TT 


( 10 ) 


It turns out that the features described above are not unique to the 2x2 Hamilto¬ 
nian (j^. In fact, this simple example can be generalized to larger matrices. In particular, 
one can define an ensemble of matrices drawn from a random Gaussian distribution [72]: 


P{H) oc exp 



= exp 

2a2 ^ 



d 


( 11 ) 


where a sets the overall energy scale and, as before, /3 = 1 refers to systems with time- 
reversal symmetry where all entries in the Hamiltonian are real and satisfy Plij = Hji, 
that is, the so-called Gaussian orthogonal ensemble (GOE), and (3 = 2 refers to systems 
without time-reversal symmetry, where the entities are complex and satisfy Hij = P[*^, 
that is, the so-called Gaussian unitary ensemble (GUE)j^Note that the factor of \/2 in 
Eq. Q ensures that the Hamiltonian is described by the distribution ( |11[ ). 

The choice of the ensemble in Eq. (0 is a natural one. The ensemble must be invari¬ 
ant under any orthogonal (GOE) or unitary (GUE) transformation, so the probability 
distribution can only depend on the invariant It is Gaussian because is 

a sum of many independent contributions and should therefore satisfy the central limit 
theorem. We will not discuss the details of the derivations of the level statistics for such 
random ensembles, which can be found in Refs. [ZIHTlllHlj. We only point out that the 
exact level spacing distributions (known as Wigner-Dyson distributions) do not have a 


®There is a third ensemble, corresponding to /3 = 4, known as the Gaussian simplectic ensemble (GSE). We will 
not discuss here. 
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closed analytic form. However, they are qualitatively (and quantitatively) close to the 
Wigner Surmise (j^. 

Following Wigner’s ideas, it was possible to explain the statistical properties of the 
spectra of complex nuclei. However, for a long time it was not clear which are the “com¬ 
plex systems” for which RMT is generally applicable. In 1984, Bohigas, Giannoni, and 
Schmit, studying a single particle placed in an infinite potential well with the shape of a 
Sinai billiard, found that at high energies (i.e., in the semi-classical limit), and provided 
that one looks at a sufficiently narrow energy window, the level statistics is described 
by the Wigner-Dyson distribution [85]. Based on this discovery, they conjecture that the 
level statistics of quantum systems that have a classically chaotic counterpart are de¬ 
scribed by RMT (this is known as the BGS conjecture). This conjecture has been tested 
and confirmed in many different setups (we will show some of them in the next section). 
To date, only non-generic counterexamples, such as arithmetic billiards, are known to 
violate this conjecture |86|. Therefore, the emergence of Wigner-Dyson statistics for the 
level spacings is often considered as a defining property of quantum chaotic systems, 
whether such systems have a classical counterpart or not. 

2.2.1. Chaotic Eigenfunctions 

RMT allows one to make an important statement about the eigenvectors of random 
matrices. The joint probability distribution of components of eigenvectors can be written 
as [721 [87] 


-Pgoe(V'i, V’2, ■ ■ •, i’N) oc XI V’l 


-1 


-Pgue(V'i, V’2, ■ ■ ■ , iPn) OC 6 


- 1 


( 12 ) 

where 'ijjj are the components of the wave functions in some hxed basis. This form follows 
from the fact that, because of the orthogonal (unitary) invariance of the random matrix 

ensemble, the distribution can depend only on the norm of the 

eigenvector, and must be proportional to the ^-functions in Eq. (12) because of the 
normalization m Essentially, Eq. ( |12[ ) states that the eigenvectors of random matrices 
are random unit vectors, which are either real (in the GOE) or complex (in the GUE). 
Of course, different eigenvectors are not completely independent since they need to be 
orthogonal to each other. However, because two uncorrelated random vectors in a large¬ 
dimensional space are, in any case, nearly orthogonal, in many instances the correlations 
due to this orthogonality condition can be ignored. 

One may wonder about the classical limit of quantum eigenvectors. The latter are 
stationary states of the system and should therefore correspond to stationary (time- 
averaged) trajectories in the classical limit. In integrable systems with a classical limit, 
the quantum eigenstates factorize into a product of WKB-like states describing the sta¬ 
tionary phase-space probability distribution of a particle corresponding to one of the 
trajectories m- However, if the system is chaotic, the classical limit of the quantum 
eigenstates is ill-defined. In particular, in the classical limit, there is no smooth (dif¬ 
ferentiable) analytic function that can describe the eigenstates of chaotic systems. This 
conclusion follows from the BGS conjecture, which implies that the eigenstates of a 
chaotic Hamiltonian in non-fine-tuned bases, including the real space basis, are essen¬ 
tially random vectors with no structure. 

Let us address a point that often generates confusion. Any given (Hermitian) Hamil¬ 
tonian, whether it is drawn from a random matrix ensemble or not, can be diagonalized 
and its eigenvectors form a basis. In this basis, the Hamiltonian is diagonal and RMT 
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specifies the statistics of the eigenvalues. The statistical properties of the eigenstates are 
specified for an ensemble of random Hamiltonians in a fixed basis. If we fix the basis 
to be that of the eigenkets of the first random Hamiltonian we diagonalize, that basis 
will not be special for other randomly drawn Hamiltonians. Therefore, all statements 
made will hold for the ensemble even if they fail for one of the Hamiltonians. The issue 
of the basis becomes more subtle when one deals with physical Hamiltonians. Here, one 
can ask what happens if we diagonalize a physical Hamiltonian and use the eigenvectors 
obtained as a basis to write a slightly modified version of the same Hamiltonian (which 
is obtained, say, by slightly changing the strength of the interactions between particles). 
As we discuss below (see also Ref. m), especially in the context of many-body systems, 
the eigenstates of chaotic quantum Hamiltonians [which are away from the edge(s) of 
the spectrum]^ are very sensitive to small perturbations. Hence, one expects that the 
perturbed Hamiltonian will look like a random matrix when written in the unperturbed 
basis. In that sense, writing a Hamiltonian in its own basis can be considered to be a 
fine-tuning of the basis. It is in this spirit that one should take Wigner’s insight. The 
sensitivity just mentioned in chaotic quantum systems is very similar to the sensitivity of 
classical chaotic trajectories to either initial conditions or the details of the Hamiltonian. 

2.2.2. The Structure of the Matrix Elements of Operators 

Let us now analyze the structure of matrix elements of Hermitian operators 

O = Oj|i)(i|, where 0\i)=0i\i), (13) 

i 


within RMT. For any given random Hamiltonian, for which the eigenkets are denoted 
by \m) and |n). 


Omn = (m|d|n) = 'Y^Oi{m\i){i\n) = . 

i i 


(14) 


Here, V’l” = {i\m) and similarly for iff. Recall that the eigenstates of random matrices 
in any basis are essentially random orthogonal unit vectors. Therefore, to leading order 
in l/P, where T> is the dimension of the Hilbert space, we have 


1 




(15) 


where the average (V’i”)*('0^) is over random eigenkets |m) and |n). This implies that one 
has very different expectation values for the diagonal and off-diagonal matrix elements 
of O. Indeed, using Eqs. and ( [T^ , we find 


0mm — 




= 0 , 


(16) 


and 


Omn = 0, for m^n. 


(17) 


^That one needs to be away from the edges of the spectrum can already be inferred from the fact that BGS found 
that the Wigner-Dyson distribution occurs only at sufficiently high energies m- We will discuss this point in 
detail when presenting results for many-body quantum systems. 
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Moreover, the fluctuations of the diagonal and off-diagonal matrix elements are sup¬ 
pressed by the size of the Hilbert space. For the diagonal matrix elements 

^ 

ij i,j 

= E o? {wr* - (Wf)") = ^ E o.— Sv 

i i 

where, as before, /? = 1 for the GOE and (3 = 2 for the GUE. Eor the GOE are real 
numbers), we used the relation while for the GUE are complex 

numbers), we used the relation = 2(|V’)^P)^- These results are a direct consequence 

of the Gaussian distribution of the components of the random vector Assuming that 
none of the eigenvalues Oi scales with the size of the Hilbert space, as is the case for 
physical observables, we see that the fluctuations of the diagonal matrix elements of O 
are inversely proportional to the square root of the size of the Hilbert space. 

Likewise, for the absolute value of the off-diagonal matrix elements, we have 

[dj - (19) 

i 

Gombining these expressions, we see that, to leading order in 1/D, the matrix elements 
of any operator can be written as 


Or. 



( 20 ) 


where Rmn is a random variable (which is real for the GOE and complex for the GUE) 
with zero mean and unit variance (for the GOE, the variance of the diagonal components 
Rmm is 2). It is straightforward to check that the ansatz (20) indeed correctly reproduces 
the mean and the variance of the matrix elements of O given by Eqs. (16)-(19). 


In deriving Eqs. (16)-(19), we averaged over a fictitious ensemble of random Hamilto¬ 


nians. However, from Eq. (20), it is clear that for large D the fluctuations of operators 


are small and thus one can use the ansatz (20) for a given fixed Hamiltonian. 


2.3. Berry-Tabor Conjecture 

In classical systems, an indicator of whether they are integrable or chaotic is the tem¬ 
poral behavior of nearby trajectories. In quantum systems, the role of such an indicator 
is played by the energy level statistics. In particular, for chaotic systems, as we dis¬ 
cussed before, the energy levels follow the Wigner-Dyson distribution. For quantum in¬ 
tegrable systems, the question of level statistics was first addressed by Berry and Tabor 
in 1977 [88] . For a particle in one dimension, which we already said exhibits non-chaotic 
classical dynamics if the Hamiltonian is time independent, we know that if we place it in 
a harmonic potential all levels are equidistant, while if we place it in an infinite well the 
spacing between levels increases as the energy of the levels increases. Hence, the statistics 
of the level spacings strongly depends on the details of the potential considered. This 
is unique to one particle in one dimension. It changes if one considers systems whose 
classical counterparts have more than one degree of freedom, for example, one particle in 
higher dimensional potentials, or many particles in one dimension. A very simple example 


13 





























August 2, 2016 


Advances in Physics Review 


of a non-ergodic system, with many degrees of freedom, would be an array of indepen¬ 
dent harmonic oscillators with incommensurate frequencies. These can be, for example, 
the normal modes in a harmonic chain. Because these oscillators can be diagonalized 
independently, the many-body energy levels of such a system can be computed as 

^ = ( 21 ) 

3 

where nj are the occupation numbers and uij are the mode frequencies. If we look into 
high energies, E, when the occupation numbers are large, nearby energy levels (which 
are very closely spaced) can come from very different sets of {nj}. This means that the 
energy levels E are effectively uncorrelated with each other and can be treated as random 
numbers. Their distribution then should be described by Poisson statistics, that is, the 
probability of having n energy levels in a particular interval [E, E + 5E] will be 

\n 

P„ = —exp[-A], (22) 

where A is the average number of levels in that interval. Poisson and Wigner-Dyson statis¬ 
tics are very different in that, in the former there is no level repulsion. For systems with 
Poisson statistics, the distribution of energy level separations to (with mean separation 
set to one) is 


Poiuj) = exp[-a;]. 


(23) 


which is very different from the Wigner Surmise (10). The statement that, for quantum 


systems whose corresponding classical counterpart is integrable, the energy eigenvalues 
generically behave like a sequence of independent random variables, that is, exhibit Pois¬ 
son statistics, is now known as the Berry-Tabor conjecture [88]. While this conjecture 
describes what is seen in many quantum systems whose classical counterpart is integrable, 
and integrable quantum systems without a classical counterpart, there are examples for 
which it fails (such as the single particle in the harmonic potential described above and 
other harmonic systems [89] 1. Deviations from Poisson statistics are usually the result 
of having symmetries in the Hamiltonian that lead to extra degeneracies resulting in 
commensurability of the spectra. 

The ideas discussed above are now regularly used when dealing with many-particle 
systems. The statistics of the energy levels of many-body Hamiltonians serves as one of 
the main indicators of quantum chaos or, conversely, of quantum integrability. As the 
energy levels become denser, the level statistics asymptotically approaches either the 
Wigner-Dyson or the Poisson distribution. It is interesting to note that in few-particle 
systems, like a particle in a billiard, the spectra become denser by going to the semi- 
classical limit by either increasing the energy or decreasing Planck’s constant, while in 
many-particle systems one can achieve this by going to the thermodynamic limit. This 
means that the level statistics indicators can be used to characterize whether a quantum 
system is chaotic or not even when it does not have a classical limit. This is the case, 
for example, for lattice systems consisting of spins 1/2 or interacting fermions described 
within the one-band approximation. 

Finally, as we discuss later, the applicability of RMT requires that the energy levels 
analyzed are far from the edges of the spectrum and that the density of states as a 
function of energy is accounted for. The first implies that one needs to exclude, say, the 
ground state and low-lying excited states and the states with the highest energies (if 
the spectrum is bounded from above). It is plausible that in generic systems only states 
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within subextensive energy windows near the edges of the spectrum are not described 
by RMT. 


2.4. The Semi-Classical Limit and Berry’s Conjecture 

One of the most remarkable connections between the structure of the eigenstates 
of chaotic systems in the semi-classical limit and classical chaos was formulated by 
Berry [90], and is currently known as Berry’s conjecture. (In this section, we return 
h to our equations since it will be important for taking the classical limit h —>• 0.) 

In order to discuss Berry’s conjecture, we need to introduce the Wigner function 
kk(x, p), which is defined as the Wigner-Weyl transform of the density matrix p |751I76| . 
For a pure state, p = |'!/;)('i/’|, one has 


iy(x,p) = 


( x + ^ 1 V' ( X - ^ 1 exp 


(27rh) 


3Af 


\ 

r .P^i 

1 exp 

_ * h _ 


(24) 


where x, p are the coordinates and momenta of V-particles spanning a 6iV-dimensional 
phase space. For a mixed state one replaces the product 


r 




p 




(25) 


where p is the density matrix. One can check that, similarly, the Wigner function can be 
defined by integrating over momentum 


W(x,p) 


1 

{2Trh)^^ 



P-^ 
^ 2 


exp 


xr; 

*irj’ 


(26) 


where <^(p) is the Fourier transform of V’(x). From either of the two representations it 
immediately follows that 


d^^pW{x,p) = and / IF(x,p) = |(;/i(p)| 


'3N„ 


(27) 


The Wigner function is uniquely dehned for any wave function (or density matrix) and 
plays the role of a quasi-probability distribution in phase space. In particular, it allows 
one to compute an expectation value of any observable O as a standard average [751 [76|: 

(O) = I d^^xd^^pOw{^,p)W{^,p), (28) 


where Ovv(x, p) is the Weyl symbol of the operator O 


Oiy(x,p) 


1 

{27Th)^^ 



2 




■ P^' 

0 

X + - )exp 

_ * h _ 


(29) 


We note that instead of the coordinate and momentum phase-space variables one can, 
for example, use coherent state variables to represent electromagnetic or matter waves, 
angular momentum variables to represent spin systems, or any other set of canonically 
conjugate variables m- 
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Berry’s conjecture postulates that, in the semi-classical limit of a quantum system 
whose classical counterpart is chaotic, the Wigner function of energy eigenstates (aver¬ 
aged over a vanishingly small phase space) reduces to the microcanonical distribution. 
More precisely, dehne 


W(X,P) 



dxidpi 

(27rh) 



dx^dpN 

(27rh) 


lV(x,p), 


(30) 


where AOj is a small phase-space volume centered around the point Xj, Pj. This volume 
is chosen such that, as h —)• 0, AUj —)• 0 and at the same time h/Ail,j —?■ 0. Berry’s 
conjecture then states that, as h —)• 0, 


IT(X,P) 


fd^^Xd^^P6[E-H(X,P)f^^ ^(X,P)], 


(31) 


where H(X.,P) is the classical Hamiltonian describing the system, and 5[...] is a one¬ 
dimensional Dirac delta function. In Berry’s words, “V'(x) is a Gaussian random function 
of X whose spectrum at x is simply the local average of the Wigner function IT(x, p)”. 
Berry also considered quantum systems whose classical counterpart is integrable. He 
conjectured that the structure of the energy eigenstates of such systems is very different 


m- 

It follows from Berry’s conjecture, see Eqs. (28) and (31), that the energy eigenstate 
expectation value of any observable in the semi-classical limit (of a quantum system 
whose classical counterpart is chaotic) is the same as a microcanonical average. 

For a dilute gas of hard spheres, this was studied by Srednicki in 1994 [27]. Let us 
analyze the latter example in detail. Srednicki argued that the eigenstate corresponding 
to a high-energy eigenvalue En can chosen to be real and written as 


V^n(x) =Afn 


d^^pAn{p)6{p‘^ - 2mEn) exp[ipx/h], 


(32) 


where Mn is a normalization constant, and ^5!j(p) = H„(—p). In other words the energy 
eigenstates with energy En are given by a superposition of plane-waves with momentum 
p such that En = p^/(2m). Assuming Berry’s conjecture applies, A„(p) was taken to be 
a Gaussian random variable satisfying 


/ /l ^ ^ 4 ^ iT 6^^{p + p) 

{Am{p)An{p ))ee = ^mnYTTX) -TVm ' (33) 

<i(|p| - Ip r) 

Here the average should be understood as over a fictitious “eigenstate ensemble” of 
energy eigenstates of the system indicated by “EE”. This replaces the average over a 
small phase-space volume used by Berry. The denominator in this expression is needed 
for proper normalization. 

From these assumptions, it follows that 

{(Kn{p)(t>n{p'))EE = ^mn A'n 5{p^ - 2mEn)5^ {p - p'), (34) 


where <?in(p) is the 3V-dimensional Fourier transform of 'i/’n(x), and (5\/(p) = 

(27rh)“^^ jy exp[ipx/h] and V = is the volume of the system. It is straight¬ 
forward to check, using Eqs. (34) and (26), that the Wigner function averaged over the 
eigenstate ensemble is indeed equivalent to the microcanonical distribution [Eq. (HI)]. 
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Using Eqs. (32)-(34) one can compute observables of interest in the eigenstates of 
the Hamiltonian. For example, substituting <5v"(0) —)• [L/(27r^)]^'^, one can calculate the 
momentum distribution function of particles in the eigenstate ensemble 


{(t)nn{Vl))EE^ J d^P2 ■ ■ ■ d^PN {<Pn{p)4’n{p)) EE = J d^P2 ■ ■ ■ d^PN^ip^-2mEn). 

(35) 

Finally, using the fact that In{A) = f d^p6{p‘^ — A) = (7r^)^/^/[r(V/2)^], which 
through the normalization of </>n(p) allows one to determine N~^ = I^j^{2mEn)-, 

one obtains 


{<i>nn{Pl))EE = 


hN-?,{2mEn - Pi) 

/3Ar(2m£'„) 


r(3V/2) 


r[3(V — l)/2] \2TTmE, 


1 


1 - 


P? 


2mEr, 


(3iV-5) 


(36) 

Defining a microcanonical temperature using En = 3NkBTn/2, where ks is the Boltz¬ 
mann constant, and taking the limit V —>■ oo one gets 


i(Pi))e£; = 


1 


2'KmkBT„ 


exp 


P? 


2mkBT„ 


(37) 


where we used that lim 7 v^oor(V -|- B)/[T{N)N^] = 1, which is valid for B € H. We 
immediately see that the result obtained for {4>nn{Pi)) ee is the Maxwell-Boltzmann 
distribution of momenta in a thermal gas. 

One can go a step further and show that the fluctuations of 4>nn{pi) about {4>nniPi)) ee 
are exponentially small in the number of particles m- Furthermore, it can be shown that 
if one requires the wave function '!/’n(x) to be completely symmetric or completely anti¬ 


symmetric one obtains, instead of the Maxwell-Boltzmann distribution in Fq. (37), the 


(canonical) Bose-Einstein or Fermi-Dirac distributions, respectively |27]. An approach 
rooted in RMT was also used by Flambaum and Izrailev to obtain, starting from sta¬ 
tistical properties of the structure of chaotic eigenstates, the Fermi-Dirac distribution 
function in interacting fermionic systems [Ml [92]. These ideas underlie the eigenstate 
thermalization hypothesis, which is the focus of this review. 

In closing this section, let us note that formulating a slightly modified conjecture one 
can also recover the classical limit from the eigenstates. Namely, one can define a different 
coarse-graining procedure for the Wigner function: 


(W(x,p)) = 


1 


Nse 


lUm(x,p), 


(38) 




where the sum is taken over all, Nse, eigenstates in a window 5E, which vanishes in the 
limit h ^ 0 but contains an exponential (in the number of degrees of freedom) number 
of levels. We anticipate that in the limit fi —)• 0 the function (IU(x, p)) also reduces to 


the right-hand-side (RHS) of Eq. (30), that is. 


(W(x,p)) = 


f d^^Xd^^PSlE - B(X, P)] 


dlE-H(X,P) 


(39) 


However, this result does not require the system to be ergodic. While this conjecture 
has little to do with chaos and ergodicity, it suggests a rigorous way of defining classi¬ 
cal microcanonical ensembles from quantum eigenstates. This is opposed to individual 
quantum states, which as we discussed do not have a well-defined classical counterpart. 
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3. Quantum Chaos in Physical Systems 

3.1. Examples of Wigner-Dyson and Poisson Statistics 

Random matrix statistics has found many applications since its introduction by Wigner. 
They extend far beyond the framework of the original motivation, and have been inten¬ 
sively explored in many fields (for a recent comprehensive review, see Ref. [H3] ) ■ Examples 
of quantum systems whose spectra exhibit Wigner-Dyson statistics are: (i) heavy nuclei 
[94j . (ii) Sinai billiards (square or rectangular cavities with circular potential barriers in 
the center) [85], which are classically chaotic as the Bunimovich stadium in Fig. (hi) 
highly excited level^of the hydrogen atom in a strong magnetic field |95|, (iv) Spin-1/2 
systems and spin-polarized fermions in one-dimensional lattices |69L [70] . Interestingly, the 
Wigner-Dyson statistics is also the distribution of spacings between zeros of the Riemann 
zeta function, which is directly related to prime numbers. In turn, these zeros can be 
interpreted as Fisher zeros of the partition function of a particular system of free bosons 
(see Appendix [B|) . In this section, we discuss in more detail some examples originating 
from over 30 years of research. 

Heavy nuclei - Perhaps the most famous example demonstrating the Wigner-Dyson 
statistics is shown in Fig. That figure depicts the cumulative data of the level spacing 
distribution obtained from slow neutron resonance data and proton resonance data of 
around 30 different heavy nuclei inillej. All spacings are normalized by the mean level 
spacing. The data are shown as a histogram and the two solid lines depict the (GOE) 
Wigner-Dyson distribution and the Poisson distribution. One can see that the Wigner- 
Dyson distribution works very well, confirming Wigner’s original idea. 



Figure 2. Nearest neighbor spacing distribution for the “Nuclear Data Ensemble” comprising 1726 spacings 
(histogram) versus normalized (to the mean) level spacing. The two lines represent predictions of the random 
matrix GOE ensemble and the Poisson distribution. Taken from Ref. [96]. See also Ref. ED. 


Single particle in a cavity - Next, let us consider a much simpler setup, namely, the 
energy spectrum of a single particle in a cavity. Here, we can contrast the Berry-Tabor 
and BGS conjectures. To this end, in Fig. we show the distribution of level spacings 
for two cavities: (left panel) an integrable rectangular cavity with sides a and b such 
that a/b = ^ and ab = dvr and (right panel) a chaotic cavity constructed from two 
circular arcs and two line segments (see inset) (HOj. These two plots beautifully confirm 
the two conjectures. The distribution on the left panel, as expected from the Berry-Tabor 
conjecture, is very well described by the Poisson distribution. This occurs despite the fact 


®The low-energy spectra of this system exhibits Poissonian level statistics. This is understandable as, at low 
energies, the motion of the equivalent classical system is regular ESj. See also Fig.jil 
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Figure 3. (Left panel) Distribution of 250,000 single-particle energy level spacings in a rectangular two- 
dimensional box with sides a and h such that a/b = and ab = Air. (Right panel) Distribution of 50,000 
single-particle energy level spacings in a chaotic cavity consisting of two arcs and two line segments (see inset). 
The solid lines show the Poisson (left panel) and the GOE (right panel) distributions. From Ref. m- 


that the corresponding classical system has only two degrees of freedoms [recall that in 
the argument used to justify the Berry-Tabor conjecture, Eqs. (21)-(23), we relied on 
having many degrees of freedom]. The right panel depicts a level distribution that is in 
perfect agreement with the GOE, in accordance with the BGS conjecture. 

Hydrogen atom in a magnetic field - A demonstration of a crossover between Pois¬ 
son statistics and Wigner-Dyson statistics can be seen in another single-particle system 
- a hydrogen atom in a magnetic field. The latter breaks the rotational symmetry of the 
Goulomb potential and hence there is no conservation of the total angular momentum. As 
a result, the classical system has coexistence of regions with both regular (occurring at 
lower energies) and chaotic (occurring at higher energies) motion [98]. Results of numeri¬ 
cal simulations (see Eig.[^ show a clear interpolation between Poisson and Wigner-Dyson 
level statistics as the dimensionless energy (denoted by E) increases [95]. Note that at 
intermediate energies the statistics is neither Poissonian nor Wigner-Dyson, suggesting 



Figure 4. The level spacing distribution of a hydrogen atom in a magnetic field. Different plots correspond to 
different mean dimensionless energies E, measured in natural energy units proportional to where B is the 

magnetic field. As the energy increases one observes a crossover between Poisson and Wigner-Dyson statistics. 
The numerical results are fitted to a Brody distribution (solid lines) m, and to a semi-classical formula due to 
Berry and Robnik (dashed lines) EZ). From Ref. m- 
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that the structure of the energy levels in this range is richer. In the plots shown, the 
numerical results are fitted to a Brody distribution (solid lines) [57], which interpolates 
between the Poisson distribution and the GOE Wigner surmise, and to a semi-classical 
formula due to Berry and Robnik (dashed lines) [^. We are not aware of a universal 
description of Hamiltonian ensembles corresponding to the intermediate distribution. 
Lattice models - As we briefly mentioned earlier, RMT theory and the Berry-Tabor 
conjecture also apply to interacting many-particle systems that do not have a classical 
counterpart. There are several models in one-dimensional lattices that fall in this cate¬ 
gory. They allow one to study the crossover between integrable and nonintegrable regimes 
by tuning parameters of the Hamiltonian |33j . A few of these models have been studied 
in great detail in recent years [emiToiiMHoi]. Here, we show results for a prototypical 
lattice model of spinless (spin-polarized) fermions with nearest and next-nearest neigh¬ 
bor hoppings (with matrix elements J and J', respectively) and nearest and next-nearest 
neighbor interactions (with strengths V and E', respectively) [69]. The Hamiltonian can 
be written as 


j = l L 


-J yfjfj +1 + H.c.j +V\n^--] I n4M^ - - 
-J' {fjfj +2 + H.c.) + E' (hj - M - I 


(40) 


where fj and /| are fermionic annihilation and creation operators at site j, hj 


= fU, 

the occupation operator at site j, and L is the number of lattice sites. Periodic boundary 
conditions are applied, which means that = /i and ^ f 2 - classical limit 

for this model can be obtained at very low fillings and sufficiently high energies. In 
the simulations presented below, the filling [N/L) has been fixed to 1/3. Therefore, 
quantum effects are important at any value of the energy. In this example, we approach 
a dense energy spectrum, and quantum chaos, by increasing the system size L. The 


IS 


Hamiltonian (40) is integrable when J' = V' = 0, and can be mapped (up to a possible 
boundary term) onto the well-known spin-1/2 XXZ chain 


It is important to stress that Hamiltonian (40) is translationally invariant. This means 


that when diagonalized in quasi-momentum space, different total quasi-momentum sec¬ 
tors (labeled by k in what follows) are decoupled. In addition, some of those sectors 
can have extra space symmetries, for example, k = 0 has reflection symmetry. Finally, 
if J' = 0, this model exhibits particle-hole symmetry at half-filling. Whenever carrying 
out an analysis of the level spacing distribution, all those discrete symmetries need to 
be accounted for, that is, one needs to look at sectors of the Hamiltonian that are free 
of them. If one fails to do so, a quantum chaotic system may appear to be integrable 
as there is no level repulsion between levels in different symmetry sectors. All results 
reported in this review for models with discrete symmetries are obtained after properly 
taking them into account. 

In Fig.[^a)-[^g), we show the level spacing distribution P{uj) of a system described by 
the Hamiltonian ( |40[ ), with L = 24 (see Ref. |69j for further details), as the strength of 
the integrability breaking terms is increased. Two features are immediately apparent in 
the plots: (i) for J' = V' = 0, i.e., at the integrable point, P{oj) is almost indistinguish¬ 
able from the Poisson distribution and (ii) for large values of the integrability breaking 
perturbation, P{uj) is almost indistinguishable from a Wigner-Dyson distribution [GOE 
in this case, as Eq. (40) is time-reversal invariant]. In between, as in Fig. there is a 
crossover regime in which the distribution is neither Poisson nor Wigner-Dyson. How¬ 
ever, as made apparent by the results in panel (h), as the system size increases the level 
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Figure 5. (a)—(g) Level spacing distribution of spinless fermions in a one-dimensional lattice with Hamiltonian 

( |40| |. They are the average over the level spacing distributions of all fc-sectors (see text) with no additional 
symmetries (see Ref. m for details). Results are reported for L = 24, N = L/3, J = V = 1 (unit of energy), and 
J' = V' (shown in the panels) vs the normalized level spacing u). The smooth continuous lines are the Poisson 
and Wigner-Dyson (GOE) distributions, (h) Position of the maximum of P(aj), denoted as ojmax^ vs J' = V^ for 
three lattice sizes. The horizontal dashed line is the GOE prediction. Adapted from Ref. m- 


spacing statistics becomes indistinguishable of the RMT prediction at smaller values of 
the integrability breaking parameters. This suggests that, at least for this class of mod¬ 
els, an infinitesimal integrability breaking perturbation is sufficient to generate quantum 
chaos in the thermodynamic limit. Recent numerical studies have attempted to quantify 
how the strength of the integrability breaking terms should scale with the system size 
for the GOE predictions to hold in one dimension |105l I106j . These works suggest that 
the strength needs to be oc L~^ for this to happen, but the origin of such a scaling 
is not understood. Moreover, it is unclear how generic these results are. In particular, 
in disordered systems that exhibit many-body localization, it has been argued that the 
transition from the Poisson to the Wigner-Dyson statistics occurs at a hnite value of the 
interaction strength. This corresponds to a finite threshold of the integrability breaking 
perturbation even in the thermodynamic limit (see Ref. m and references therein). 


3.2. The Structure of Many-Body Eigenstates 


As we discussed in Sec.[^ RMT makes important predictions about the random nature of 
eigenstates in chaotic systems. According to Eq. (12), any eigenvector of a matrix belong¬ 


ing to random matrix ensembles is a random unit vector, meaning that each eigenvectors 
is evenly distributed over all basis states. However, as we show here, in real systems the 
eigenstates have more structure. As a measure of delocalization of the eigenstates over a 
given fixed basis one can use the information entropy: 


S™ = - 


E 


d P In I P 

^m\ l^ml 5 


(41) 


where 


m) = Y^ 

i 



(42) 


is the expansion of the eigenstate |m) over some fixed basis \i). For the GOE, this entropy, 
irrespective of the choice of basis, should be Sgoe = ln(0.48P) -|- 0{l/'D) |93], where 
P is the dimensionality of the Hilbert space. However, numerical analyses of various 
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Figure 6. Information entropy (normalized using the GOE prediction) of the eigenstates of spinless fermions in a 
one-dimensional lattice with Hamiltonian l |40[ |. Results are reported for L = 18, 21, and 24, N = L/3, J = V = 1 
(unit of energy), and J' = V' (reported in the panels) vs the energy of the eigenstates. The information entropy 
is calculated in the basis of the eigenstates of the integrable Hamiltonian {J = V = I and J' = V' = 0), and in 
the k = 2 quasi-momentum sector (X> is the number of states in that sector). See also Ref. m- 


physical systems indicate that is only generically bounded from above by the RMT 
prediction [691 lir)2t IlDTj . This situation is characteristic of both few-particle and many- 
particle systems. For concreteness, we will illustrate this using the eigenstates of the 
Hamiltonian (40) (see Ref. |69] for details). For the fixed basis |i), we use the eigenstates 
of the integrable limit of this Hamiltonian, corresponding to J' = V' = 0. The results 
of the numerical simulations for the normalized Shannon entropy Sm/Sco^ are shown 
in Fig. [we note that Sm and S'goe were computed within a single quasi-momentum 
sector of the translationally invariant Hamiltonian (40)]. It is clear from the figure that 
the entropy of the states in the middle of the spectrum approaches the RMT prediction as 
the strength of the integrability breaking perturbation and the system size increase, while 
the states near the edges of the spectrum remain “localized”. The latter, namely, that the 
lowest and highest (if the spectrum is bounded) energy states are usually non-chaotic, is 
a generic feature of physical systems with few-body interactions and no randomness. 

Another implication of RMT is that the eigenstates of different Hamiltonians are es¬ 
sentially uncorrelated random vectors. This, of course, cannot be literary true in physical 
systems. Indeed, let us consider a family of Hamiltonians characterized by some continu¬ 
ous parameters like H{J' ,V') in Eq. (40). If we change J' —)> J' + dJ' and V —)• V + 6V\ 
then, obviously, for sufficiently small changes, 5J' and 5V\ the eigenstates of the Hamil¬ 
tonians H{J', V) and H{J' + 5J', V + 5V') will be almost the same. However, one can 
anticipate that a very small parameter change, likely vanishing exponentially with the 
system size, is sufficient to mix different eigenstates of the original Hamiltonian with 
nearby energies such that new eigenstates look essentially random in the old basis. This 
is indeed what is seen in the numerical simulations. In Fig. [^a), we show the scaled 
information entropy 5 '/S'goe of the eigenstates of H{J' + 5J\ V + 5V') in the basis of 
H{J' ,V') as a function of 6J' = 5V'. These results show that at fixed values of 5J' and 
5V', the information entropy rapidly increases with the system size. In Fig.j^b), we show 
the same entropy plot vs the integrability breaking perturbation, but now scaled by a 
power of the mean level spacing 5J'/{6e)^ = 5V'/{6e)°‘. We found numerically that there 
is good data collapse for a « 0.43. While it is necessary to study much larger system sizes 
to determine the exponent a accurately, for the system sizes available to us it is already 
apparent that the relevant strength of the integrability breaking perturbation needed 
for a complete randomization of the energy levels is exponentially small in the system 
size. Indeed be oc exp[—5(E)], where 5(E) is the thermodynamic entropy of the system, 
which scales linearly in the system size, and E is the average energy of the eigenstates 
for which the information entropy is computed. 
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Figure 7. Average information entropy (normalized using the GOE prediction) of the eigenstates of spinless 
fermions in a one-dimensional lattice with Hamiltonian The average is computed over the central 10% of 

the energy spectrum. Results are reported for L = 18, 21, and 24, N = L/3, J = V = 1 (unit of energy), 
and J' = V' as one departs from J' = V' = 0.5. The information entropy is calculated in the basis of the 
eigenstates of the nonintegrable Hamiltonian with J = V = 1 and J' = V' = 0.5, and in the k = 2 quasi¬ 
momentum sector (T> is the number of states in that sector), (a) The average information entropy is reported as 
a function of SJ' = J' — 0.5 = 5V' = V' — 0.5. (b) The average information entropy is reported as a function of 
= 5V'/^ where 5e is the average level spacing between the eigenstates used to compute the 
average entropy. 


3.3. Quantum Chaos and Entanglement 

So far, we have discussed manifestations of quantum chaos in the statistics of level spac- 
ings and in the properties of many-body Hamiltonian eigenstates. At the same time, 
as we discussed in Sec. classical chaotic systems do not have a well-defined analogue 
of stationary eigenstates because they do not have closed stationary orbits. Chaos in 
classical systems is usually defined as the exponential divergence in time of nearby tra¬ 
jectories. But this language does not apply to quantum systems, which do not have a 
well-defined notion of a trajectory. So, it seems that there is a fundamental discrepancy 
between the quantum and classical ways of defining chaos. Nevertheless, this discrepancy 
is somewhat superficial and one can treat quantum and classical chaos on the same foot¬ 
ing by analyzing delocalization of the system either in phase space or in energy space, 
and using appropriate entropy measures to characterize this delocalization. Using such 
measures, it is possible to smoothly interpolate between quantum and classical regimes 
in chaotic systems and analyze various quantum to classical crossovers. However, some 
care is needed in defining such measures. To this end, here we first discuss the problem 
for classical systems and then extend the ideas to quantum systems. 

Let us consider a setup in which the system is prepared in some initial state and 
is allowed to evolve according to some time-independent Hamiltonian H. If the initial 
state is a stationary state (namely, a stationary probability distribution) of some initial 
Hamiltonian Hq 7 ^ H then this is what is usually called a quench. For example, one can 
consider a gas of particles in thermal equilibrium in a recipient with a given volume, and 
then one suddenly doubles the volume of the recipient, for example, by moving a piston. 
Alternatively, one can consider an equilibrium system of spins (classical or quantum) in 
which one suddenly changes a magnetic field or the coupling between the spins. A strong 
physical manifestation of chaos in classical systems is delocalization in the available phase 
space after the quench. A standard measure of this delocalization is the entropy, which 
is defined in phase space as 

-S' = - y J /^(x, P) ln[p(x, p)] , (43) 

where /?(x, p) is the classical probability distribution, D is the dimensionality of the phase 
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space, and the usual factor of {2'Kh)^ is introduced to make the integration measure 
dimensionless. This entropy is maximized when p(x, p) is uniform in the available phase 
space. If the system is isolated, then according to Liouville’s theorem the entropy (43) 
is conserved in time |108j . This is a consequence of the incompressibility of classical 
trajectories, which implies that the phase-space volume occupied by any closed system 
does not change in time. Liouville’s theorem and the lack of the entropy increase was a 
topic of controversy for a long time, since Boltzmann introduced his H-theorem. 

To circumvent this problem and use entropy as a measure of delocalization in the 
available phase space, one can analyze the reduced probability distribution of Na particles 
obtained by averaging over the positions and momenta of the remaining N—Na particles, 

Pa(xi, . . . ,XAr^,pi, . . . 

dyiNA+idPNA+i ■ • •dxAr(ipAr/9(xi,... ,XAr,Pi,.. • ,PAr,t) , (44) 


and compute the entropy of this reduced probability distribution. This entropy is not 
restricted by Liouville’s theorem and after a quench, for sufficiently large subsystems of 
an ergodic system (and for Na 'C N), it is expected to increase in time to the maximum 
value given by the Gibbs distribution. 

In quantum systems, the situation is remarkably similar. Instead of a probability dis¬ 
tribution one deals with a density matrix p. A direct analogue of the classical (Liouville) 


entropy (43) is the von Neumann entropy: 


5vn = -Trlplnp]. 


(45) 


Similar to classical systems, the von Neumann entropy is conserved in time for isolated 
systems, which is a simple consequence of unitary evolution. Hence, extending the analogy 
to classical systems, we can define the reduced density matrix of a quantum system using 
a partial trace (typically, one traces over a region in real space): 


PA = TrB[p] = ^ |nA)(n(4| ^(nA,ras|p|n(4,nB), 
nA,n'A 


(46) 


where {ha) and {ns) are the complete basis states of the subsystems A and B, respec¬ 
tively. One can then define the von Neumann entropy of the reduced density matrix 

= -Tta [pA In pa] ■ (47) 

If the full density matrix is that of a pure state, that is, p = then this entropy 

5^ is also called the entanglement entropy. The entanglement entropy has been studied 
in the context of quenches and thermalization in clean interacting systems [10911113] . as 
well as in disordered systems in the context of many-body localization following quantum 
quenches and in the presence of a periodic drive [11414118] . 

These ideas were recently tested in experiments with small systems involving super¬ 
conducting qubits [119j and ultracold atoms We will discuss the superconducting 

qubit experiment in the next section. Here, we review the results of the ultracold atom 
experiment. There, the authors prepared two identical chains each with six sites. The 
Hamiltonian describing the system is 

H = ^ + H.c.) - Jy + H-c.), (48) 

ij ij i 
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Figure 8. (Left panel) Second Renyi entanglement entropy vs subsystem size for a six-site Bose-Hubbard 
chain measured at long times after a quench (red symbols) and in the ground state (blue symbols). Red and 
blue lines following the data points depict the theoretical predictions, while the gray (straight) line depicts the 
theoretical prediction for the Renyi entropy of the system in thermal equilibrium. From Ref. mi- (Right panel) 
Entanglement entropy as a function of a subsystem size for different representative eigenstates of the spin-1/2 
Hamiltonian ( |50| l for L = 20. The inverse temperature (3 in both panels is obtained by matching the energies to 
those of systems in thermal equilibrium. The entanglement entropy grows linearly with when and (3 are 
small, and coincides with the equilibrium entropy of the Gibbs ensemble. From Ref. m- 


where i = 1,..., 6 is the site coordinate along the x-direction and j = 1,2 is the site 
coordinate along the y-direction. As usual, ajj and a-j are the boson creation and anni¬ 
hilation operators, respectively, and hj ,■ = a] -a, ,• is the site occupation number. 

The system was initialized in a Fock state with exactly one particle per site and both 
Jx and Jy being essentially equal to zero. At time t = 0, the tunneling along the x- 
direction [Jx) was quenched to Jx/U ~ 0.64, with Jy remaining negligible. The system 
was then allowed to evolve. This way, two identical copies of a many-body state were 
created. Implementing a swap operation |1211I122] . it was possible to measure the second 
Renyi entanglement entropy for each chain: 

S 2 =- In [Trp^] . (49) 

The latter is very similar to the von Neumann entanglement entropy. If the system 
is quantum chaotic, is expected to coincide with the corresponding entropy in the 
thermal ensemble for L^jL <1/2 (up to finite-size corrections). We note that, in general, 
bounds 5^ from above. The two entropies are equal for (maximally entangled) infinite 
temperature states and (non-entangled) product states. 

In the left panel in Figure we show the measured long-time result of the second 
Renyi entanglement entropy after the quench as a function of the subsystem size. It is 
remarkable that, even for such a small system, S 2 is very close to the entropy of a Gibbs 
ensemble with the same mean energy for the smallest subsystem sizes. This experiment 
shows that, even in small quantum systems, one can see clear signatures of quantum 
chaotic behavior. 

Next, it is important to discuss theoretical predictions that closely follow the exper¬ 
imental hndings. We focus on results for the spin-I/2 transverse field Ising chain, with 
Hamiltonian: 


L L-l L-l 

H = ^g^j + h&j + {h-J){al + al) + jY^ ^j^j+i■ ( 50 ) 

j=l j=2 j=l 

This model exhibits quantum chaos in the parameter range studied in Figs. (right 


25 

















August 2, 2016 


Advances in Physics Review 



Figure 9. von Neumann’s entropy of the reduced density matrix obtained after tracing one half of a spin-1/2 
chain as a function of time (see text for details). The initial state corresponds to a product state of randomly 
polarized spins. The entropy grows linearly in time and saturates at a value which is very close to the maximum, 
corresponding to the infinite temperature state: S'max = I/ln2/2 — 1/2. From Ref. 


panel) and § h = (a/S + l)/4, g = (aA + 5)/8, and J = 1. 

In the right panel in Fig. 8| we show the entanglement entropy for representative eigen¬ 
states of the Hamiltonian ( 5M as a function of the subsystem size [123j . Different curves 
are labeled according to the temperature of the Gibbs ensemble that has the same mean 
energy as the eigenstate. For small subsystem sizes, the entanglement entropy is clearly 
a linear function of the subsystem size (as in the experimental results for the second 
Renyi entanglement entropy shown in the left panel). Moreover, the slope is identical to 
the slope of the equilibrium entropy. As the subsystem size increases, the entanglement 
entropy deviates from the equilibrium result and the deviation increases as the effective 
temperature decreases. In Ref. those deviations were argued to be subextensive in 

L for any fixed ratio La_/L < \l‘l. This means that, in the thermodynamic limit and 
for any nonvanishing effective temperature, the entanglement entropy of eigenstates in 
quantum chaotic systems is expected to have a triangular shape as a function of Ta/A, 
with a cusp at La/A = 1/2. This is a result of the eigenstate thermalization phenomenon 
that we discuss in Sec. IH 

More directly related to the experiments in Ref. m, in Fig. [^we show the temporal 
evolution of the entanglement entropy obtained numerically after tracing out one half 
of the lattice in the transverse field Ising model (50). The initial state corresponds to 
a product of spins with random orientations. Such a state has zero initial entropy. As 
seen in Fig. the entropy grows linearly in time and then saturates (as expected) 
close to that of a random pure state HH: S* = Lin2/2 — 1/2. This is exactly the result 
obtained in the right panel in Fig. [^for /? = 0. Following the findings in Fig. one can 
anticipate that if one studies a classical ergodic spin chain, instead of a quantum spin 
chain, and begins the dynamics from a factorized probability distribution one would get 
a similar increase of the Liouville entropy of one half of the system. 


3.4. Quantum Chaos and Delocalization in Energy Space 

Another way to reveal delocalization of classical systems in available phase space is to 
study the time-averaged probability distribution over a time interval to 

1 

Pio(x,P) = — / /3(x,p,t) (51) 

*0 Jo 
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and compute the entropy of this distribution. Because the negative logarithm is a convex 
function, using Jensen’s inequality, it is straightforward to see that such an entropy can 
only increase as a function of to- For ergodic systems, it is expected that this entropy will 
increase to its maximally allowed value, that is, to the microcanonical entropy, because 
the system on average visits all points in phase space with equal probability. For non- 
ergodic systems, conversely, the system is expected to remain more localized in phase 
space even after time averaging, so that the entropy never reaches the microcanonical 
value. 

Continuing the analogy with classical systems, a second possibility to use entropy to 
quantify quantum delocalization is to study the entropy of the time-averaged density 
matrix. Assuming that there are no degeneracies, the off-diagonal matrix elements of the 
density matrix in the basis of the Hamiltonian oscillate in time according to [77j: 

Pmnit^ — Pruniio) to)] 


Therefore, in the quantum language, time averaging is equivalent to projecting the initial 
density matrix onto the diagonal subspace of the Hamiltonian, leading to what is known 
as the diagonal ensembl^ density matrix [28]: 

pta _^ 

PD^ = P= lim — / p{t)dt =y prnrn\'m){m\. (53) 

to^oo to Jo ^ 


Thus studying delocalization of the classical probability distribution in phase space at 
long times is equivalent, in the quantum language, to studying the spreading of the initial 
density matrix in the basis of the eigenstates of the Hamiltonian, or, simply, in energy 
space. From the discussion in Sec. 3.2, one can expect that the diagonal density matrix 


will generically be delocalized for quantum chaotic systems. For integrable systems, on 
the other hand, the diagonal density matrix can be more (or less) localized depending 
on the initial state. 

The analogy between delocalization in energy space and classical chaos was recently 
explored experimentally in a system of three coupled superconducting qubits mi, which 
effectively represent three 1/2 spins. The experiment was carried out in the sector where 
the effective total spin is 5* = 3/2, and focused on periodic kicks with: 






(54) 


where Sy and Sz are spin operators. Like the kicked rotor model, this system in the 
5—7-00 classical limit has a mixed phase space with both chaotic and regular trajecto¬ 
ries. In the experiment, the quantum system was initialized in a coherent state centered 
around some point in the two-dimensional phase space. The system was then allowed to 
evolve under kicks and, after long times, both the entanglement entropy (of one qubit) 
and the diagonal entropy (the entropy of the time-averaged density matrix) were mea¬ 
sured through quantum tomography. The results were contrasted with the phase-space 
dynamics of the 5 —)• oo classical limit. 

we show the dynamics of the entanglement entropy (left panels) and the 


In Fig. 10 


entropy of the density matrix averaged over several periods, which is equivalent to the 
Floquet diagonal entropy (middle panels). The entropies are reported (in a color scale) 
for different initial coherent states centered around spherical angles (0O)</’o)- The right 


®The diagonal ensemble will play a crucial role throughout this review. 
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Figure 10. Left panels: entanglement entropy averaged over three different qubits after a different number of kicks 
N. In the color scale used, red signals a small entropy and blue signals the maximum entropy [ln(2)]. Different 
points on each panel correspond to different initial conditions, which are coherent states centered around the 
phase-space point Middle panels: Floquet diagonal entropy, which was computed as the entropy of the 

full density matrix averaged over 20 kicks (top) and 10 kicks (bottom). Right panels: phase-space portrait of the 
classical system. The top and bottom panels depict results for different kick strengths: hi = 0.5 and k = 2.5, 
respectively. From Ref. Em¬ 


panels show the phase space portraits of the corresponding classical systems. Both the 
entanglement entropy and the Floquet diagonal entropy show strong correlations with 
classical regions of chaotic and non-chaotic motion, with higher entropy corresponding 
to more chaotic behavior. Interestingly, these correlations persist deep in the quantum 
regime {S = 3/2 is not particularly large). This experiment illustrates the ideas discussed 
in this and the previous section, namely, that quantum chaos results in delocalization of 
either the reduced density matrix of subsystems or the time-averaged density matrix of 
the full system. 

To illustrate delocalization of an initial wave function among energy eigenstates in a 
larger quantum chaotic system, we follow Ref. [ESI, which reported results for quan¬ 
tum quenches in one-dimensional periodic chains of interacting spinless fermions with 


Hamiltonian (40), and hard-core bosons with Hamiltonian 




-J (+ H.c. ] + V [ hi - ^ ] 1 hi,. - ^ 


b i+i 


1 


^ 2 


1 


b+i 


-/ i+ H.c.) + ( hj - fh ^^2 - ^ 


(55) 


where 6,- and b 
b^b 


.f 


j ..j are hard-core bosons annihilation and creation operators at site j, hj = 

is the occupation operator at site j, and L is the number of lattice sites. Hard¬ 
core bosons satisfy the same commutation relations as bosons but have the constraints 
= (b\)'^ = 0, which preclude multiple occupancy of the lattice sites [33]. For J' = 0, the 


hard-core boson Hamiltonian (55) can be mapped onto the spinless fermion Hamiltonian 


(40), up to a possible boundary term |33|. Like the spinless fermion Hamiltonian (40), 


the hard-core boson one (55) is integrable for J' = V' = 0 and nonintegrable otherwise. 


In Fig. El we show the normalized energy distribution 


P{E) = Y,PmbiE-Em), 


(56) 
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Figure 11. Normalized energy distribution function P(-E), see Eq. | |56[ |; after quenches in hard-core boson chains. 
Results are presented for two different initial states |-?/’/} corresponding to eigenstates of Hamiltonian ( |55[ l with J/ = 
0.5, V/ = 2 (top panels) and Jj = 2, Vj = 0.5 (bottom panels). The final parameters of the Hamiltonian are J = 
V = 1. J' = V' remain unchanged during the quench. Their values are indicated in the figure. Smooth solid lines: 
best Gaussian fit to exp[—(E—6)^/(2a^)] for the parameters a and 6; dashed lines: exp[—(E— 

E)^/(25E^)], where E = {'ipjlHl'ipj) and SE^ = — {■tpj\H\'ipj)^ is the energy variance after the quench. 

From Ref. 


where 


Pm = , (57) 

and is the projection of the initial state |'0/) on eigenstate \m). The results 

presented are for quenches in the hard-core boson chain. The top and bottom panels 
correspond to different initial states. The parameters of the hnal Hamiltonian are J = 
H = 1, and J' = V' with the values indicated in the figure (increasing from left to 
right). Those plots make apparent that as the system becomes more quantum chaotic 
(larger J' = V'), the energy distribution becomes less sparse, which means that the initial 
state becomes more delocalized among the eigenstates of the final Hamiltonian. Another 
visible feature of the energy distribution is that in chaotic systems it rapidly approaches 
a Gaussian centered around the mean energy E = {'^pJ\H\^l;J) and the width given by the 
variance of the energy in the initial state 5E'^ = . Similar results 

were obtained for quenches in the spinless fermion chain |125j and in other many-body 
Hamiltonians |101l 110211126j . 

For states that are eigenstates of some Hamiltonian Hj and are decomposed in the 
eigenstates of a new Hamiltonian Hp (as we did above), the normalized energy distri¬ 
bution is also known as the strength function |101l 11021112711129] . If 77^? is taken to be 
Hj plus a perturbation, it has been shown that the normalized energy distribution (the 
strength function) evolves from a Breit-Wigner form to a Gaussian form as the strength 
of the perturbation is increased |101( I102( 1129] . The transition between those distribu¬ 
tions, say, for a given state |i) (an eigenstate of Hj), has been argued to occur as the 
average value of the nonzero off-diagonal matrix elements \ {i\Hp\j) \ becomes of the same 
order of (or larger than) the average level spacing of the states \j) (also eigenstates of 
Hj) for which \{i\Hp\j)\ / 0 |101j . Hence, if Hp is quantum chaotic one expects that, 
provided that the system is large enough (the density of states increases exponentially 
fast with increasing system size), the normalized energy distribution will have a smooth 
Gaussian form after a quench independent of the nature of Hj. It has been recently shown 
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J’=V’ J’=V’ 


Figure 12. Entropies in hard-core boson chains with L = 24 (main panels) and L = 21 (insets). Results are 
reported for the diagonal entropy (Sd), the microcanonical entropy (S'm), and the canonical entropy (Sc), after 
quenches as those described in Fig. [E for initial states corresponding to eigenstates of Hamiltonian ( |55| l with 
Jj = 0.5, V/ = 2 (a) and Jj = 2, V/ = 0.5 (b). The final parameters of the Hamiltonian are J = V = 1. J' = V' 
remain unchanged in the quench and their values are depicted in the a:-axis. Thin lines joining the points are 
drawn to guide the eye. See also Ref. [T25]. 


that this is not the case in generic (experimentally relevant) quenches to integrability. 


Namely, ii Hp is integrable, the distribution of pm [see Eq. (57)] ends up being sparse 
even in quenches whose initial states are thermal states of nonintegrable Hamiltonians 
m (see Sec. |8.3|). The sparseness of pm at integrability is apparent in Fig. [fljand 


for 


much larger system sizes, has also been explicitly shown in Refs. |131H133] . 

To quantify the level of delocalization one can use the von Neumann entropy of the 
diagonal ensemblewhich is known as the diagonal entropy |125[ I134| : 


Sd = - Inpr; 


(58) 


This entropy is the same as the information entropy of the initial state in the basis of 
the Hamiltonian governing the evolution, which we analyzed in Fig. In particular, the 
entropy plotted in that figure is the diagonal entropy of the eigenstates of the Hamiltonian 
H{J'+6J', V' + 5V') in the basis of the Hamiltonian H{J', V'), averaged over eigenstates. 

In Fig. 12, we plot the diagonal entropy vs J' = V for quenches as those in 
Fig. [TT] and systems with T = 21 and L = 24. We also plot the microcanonical 
entropy Sm = ln[H(F)(5F], and the canonical entropy Sc = —Tr[/3cE InpcEj) where 
PCE = exp(—/3.H)/Tr[exp(—/3.f7)] and the inverse temperature f3 is fixed such that 
Tv{Hpcp) = E. In all the quenches in Figs. 11 and jlij the initial state was selected 
such that /3“^ 3J. Figure [l^ shows that, as one departs from the integrable point 

[J' = W = 0), the diagonal entropy becomes almost the same as the microcanonical 
entropy. This is up to finite-size effects, whose relevance to the results presented is made 
apparent by the differences between Sm and Sc- As the system size increases, Sm/L and 
Sc/L must approach each other as they are equal up to subextensive corrections. Nu¬ 
merical evidence that the diagonal entropy after a quench and the thermal equilibrium 
entropy are identical in nonintegrable systems in the thermodynamic limit has been ob¬ 
tained in numerical linked cluster expansion studies [Ml ESS]. We will explain why S^/L 


^One could also use the inverse participation ratio: IPR = ^ with pm defined as in Eq. ( |57| |. IPR = 1 

if only one state is occupied and it is maximized when all states are occupied with equal probability: IPR = T), 
where T) is the dimension of the available Hilbert space. 
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agrees with the entropy in thermal equilibrium in Sec. 5.3.1, when we discuss the funda¬ 
mental thermodynamic relation. On the other hand, in many numerical and analytical 
works it has been established that, in integrable systems, extensive differences generally 
occur between the diagonal entropy and the thermal entropy [nsi [THUi [muio] (see 
Sec. 8.3). In the spirit of our current discussion, this implies that integrable systems 
generally remain more localized in energy space. 


4. Eigenstate Thermalization 


4.1. Thermalization in Quantum Systems 

In 1929, von Neumann wrote a remarkable paper in which he discussed how statistical 
mechanics behavior could emerge in quantum-mechanical systems evolving under unitary 
dynamics m- As mentioned in the Introduction, one of von Neumann’s crucial insights 
was to focus on macroscopic observables, as opposed to focusing on the wave function or 
the density matrix of the entire system. He proved what he named the quantum ergodic 
theorem, which has been recently discussed in detail by Goldstein et al. in Ref. m- 
In the words of the latter authors, the quantum ergodic theorem (or “normal typical¬ 
ity” ((^states that ^^for a typical finite family of commuting macroscopic observables, every 
initial wave function from a microcanonical energy shell evolves so that for most times 
in the long run, the joint probability distribution of these observables obtained from the 
unitarily time-evolved wave funetion is close to their microeanonieal distribution^'. This 
theorem was a very important hrst step in the study of thermalization in quantum sys¬ 
tems. However, some shortcomings are immediately apparent given our discussion so far. 
For example, the theorem makes no distinction between integrable and nonintegrable 
systems, as such, it leaves one wondering about the role of integrability. Also, typical 
observables in von Neumann’s sense need not be relevant to experiments. As we discuss 
in Sec. 4.2.2 von Neumann’s theorem is related to RMT. Hidden in it is the seed for 


eigenstate thermalization |145| . which is the topic of this section. 

In the spirit of von Neumann’s theorem, in this review thermalization refers to observ¬ 
ables and is defined in a strong sense. Suppose that one prepares an isolated system in 
a nonstationary state with a well-defined mean energy, and subextensive energy fluctu¬ 
ations. An observable is said to thermalize if, during the time evolution of the system, 
it relaxes to the microcanonical prediction and remains close to it at most later times. 
Whether the isolated system is in a pure or mixed state is immaterial to the question of 
thermalization. 

To understand the essential ingredients needed for thermalization to occur, let us con¬ 
sider a simple setup in which an isolated system is initially prepared in a pure state 
and evolves under a time-independent Hamiltonian H. We assume that the Hamiltonian 
has eigenvectors \m) and eigenvalues Em, that is, H\m) = Em\m). The time-evolving 
wave function can be written as 




me 


(59) 


where Cm = (notice that we set h —>■ 1); and we are interested in f > 0. Obviously, 


®Not to be confused with canonical typicality [T32ifTi4l . which makes statements about the reduced density 
matrix of typical states in the microcanonical energy shell. As discussed in the introduction, experimental out- 
of-equilibrium states are atypical as a result of the way they are created. Canonical typicality does not tell how 
typical states can be reached. 

^Everything we discuss in what follows can be straightforwardly generalized to mixed states. 
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the density matrix of the system will remain that of a pure state at all times [/^(t)^ = p{t)], 
that is, it can never become a mixed (thermal) density matrix. Now, let us look at the time 
evolution of some observable O, which in the basis of the eigenstates of the Hamiltonian 
can be written as 


o{t) ^ {m\o\m) = 


Y^\cJ^Omm+ Y. 

m m, n^m 


(60) 


where Omn = (m|0|n). As stated before, we say that the observable O thermalizes if: (i) 
after some relaxation time, the average expectation value of this observable agrees with 
the microcanonical expectation value and (ii) temporal fluctuations of the expectation 
value about the microcanonical prediction are small at most later times. This implies 
that the long-time average accurately describes the expectation value of O at almost all 
times and agrees with the microcanonical prediction. 


The initial difficulties in reconciling these requirements with Eq. (60) are obvious. In 


the long-time average, the second sum in Eq. (60) averages to zero (provided there are 


no degeneracies, or that there is a nonextensive number of them) and we are left with the 
sum of the diagonal elements of O weighted by Some of the natural questions one 

can ask are: (i) Since the probabilities jCmP are conserved in time, how is it possible for 
Yhm IC'mPOmm to agree with the microcanonical average? (ii) Moreover, in many-body 
systems, the eigenenergies are exponentially close to each other and therefore, to make 


sure that the second sum in Eq. (60) averages to zero, one could potentially need to 


wait an exponentially (in system size) long time. Such a time, even for moderately small 
systems, could exceed the age of our universe, and therefore cannot be reconciled with 
the experimental observation that even large systems thermalize over much shorter time 
scales than the age of the universe (we observe them thermalize). 

Remarkably, if the Hamiltonian H was a true random matrix, then using the RMT 
prediction for observables [namely that 0mm is independent of m and that Omn for m ^ n 


is exponentially small in the system size, see Eq. (20)] one finds that the observables 
thermalize in the sense specified above. This is because the first sum in Eq. (60) becomes 
independent of the initial state 


E 


\Cm\"0, 


OY\Cn 


= o, 


(61) 


that is, it agrees with the microcanonical result. Note that within RMT, the microcanon¬ 
ical ensemble has no energy dependence and is thus formally equivalent to the infinite 
temperature ensemble. It also becomes clear that exponentially long times may not be 
needed for relaxation. The off-diagonal matrix elements of 0 are exponentially small so, 
by destroying phase coherence between a finite fraction of the eigenstates with a signif¬ 
icant contribution to the expectation value, it is possible to approach the infinite-time 
prediction with high accuracy in a time much shorter than the inverse (many-body) level 
spacing, which is required to destroy coherence between all eigenstates. We will come 
back to this later. The relevance of RMT for understanding thermalization in many-body 
quantum systems was discussed by Deutsch in a seminal paper in the early 1990s [26]. 
There, he essentially extended Berry’s conjecture to arbitrary quantum systems assuming 
that the eigenstates of ergodic Hamiltonians are essentially uncorrelated random vectors. 

In order to describe observables in experiments, however, one needs to go beyond the 
RMT prediction. This because, in contrast to random matrices, in real systems: (i) ther- 
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mal expectation values of observables depend on the energy density (temperature) of 
the systeirp^ and (ii) relaxation times are observable dependent. Hence, there is infor¬ 
mation in the diagonal and off-diagonal matrix elements of observables in real systems 
that cannot be found in RMT. In groundbreaking works throughout the 1990s, Srednicki 
provided the generalization of the RMT prediction that is needed to describe observables 
in physical systems [2711^ [TIB]. Srednicki’s ansatz is known as the ETH. It was hrst 
shown to apply to realistic quantum systems, where thermalization was observed for a 
strikingly small number of particles (5 bosons in 21 lattice sites), by Rigol et al. [28]. We 
should mention that, in a remarkable discussion of numerical experiments with 7 spins, 
Jensen and Shankar |147j advanced part of ETH [the first term on the RHS of Eq. (62)]. 
The smallness of the system they studied precluded them from observing a qualitatively 
different behavior between nonintegrable and integrable systems. 


4.2. The Eigenstate Thermalization Hypothesis (ETH) 

ETH can be formulated as an ansatz for the matrix elements of observables in the basis 
of the eigenstates of a Hamiltonian |29j : 


Omn — 0{E) 

^mn ^ 


(62) 


where E = (Em+En) /2, iv = En—Em, and S{E) is the thermodynamic entropy at energy 
E. Crucially, O [E^ and fo {E, to) are smooth functions of their arguments, the value 
O (E) is identical to the expectation value of the microcanonical ensemble at energy 
E and Rmn is a random real or complex variable with zero mean and unit variance 
{R^n = 1 or jRmnP = Ij respectively). While there is no rigorous understanding of 


which observables satisfy ETH and which do not, it is generally expected that Eq. (62) 


holds for all physical observables, namely, observables for which statistical mechanics 
applies (see, e.g., discussion in Ref. [108] 1. Specifically, ETH has been numerically verified 
for few-body observables in a variety of lattice models, no matter whether they are 
local or not (see Sec. 4.3.1). By few-body observables we mean n-body observables with 
n N, where N is the number of particles, spins, etc, in the system. This is the class 
of observables that can be experimentally studied in macroscopic systems. Projection 
operators to the eigenstates of the many-body Hamiltonian, Pm = |m)(m|, are operators 
for which Eq. (62), as well as the predictions of statistical mechanics, do not hold. In 
a recent study of lattice systems. Garrison and Grover argued that ETH can hold for 
observables with support in up to 1/2 of the system size |123j . 

The matrix elements of observables can be real or complex depending on the sym¬ 
metries of the Hamiltonian and the basis used to diagonalize it. If the system obeys 
time-reversal symmetry, the eigenstates of the Hamiltonian can be chosen to be real and 
so will be the matrix elements of observables (Hermitian operators). This is not possible 
if the system does not obey time-reversal symmetry. By taking the Hermitian conjugate 
of Eq. (62), we see that the function fo{E, uj) and the random numbers Rmn must satisfy 
the following relations 


Rnm = Rmn, fo{E, -uj) = fo{E,u}) (real matrix elements) 
R^m — Emn, fo{E,—io) = fo{E,uj) (complex matrix elements). 


(63) 


Srednicki’s ansatz (62) is similar to the RMT result in Eq. (20). The differences are: 


^®For simplicity, we assume that the energy is the only conserved quantity in the system. If there are other 
conserved quantities, they have to be treated in a similar fashion. 
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(i) The diagonal matrix elements of observables 0{E) are not the same in all eigenstates. 
Rather, they are smooth functions of the energy of the eigenstates, (ii) For the off- 
diagonal matrix elements, on top of the small Gaussian fluctuations, there is an envelope 
function fo{E, u) that depends on the mean energy and the energy difference between the 
eigenstates involved. This ansatz is consistent with results obtained in the semi-classical 
limit of quantum systems whose classical counterpart is chaotic 
The ETH ansatz reduces to the RMT prediction if one focuses on a very narrow energy 
window where the function fo{E,uj) is constant. In single-particle diffusive systems, this 
scale is given by the Thouless energy (see, e.g.. Ref. m), which is essentially equal to 
Planck’s constant divided by the diffusion time |153j : 


hD 


(64) 


where D is the diffusion constant and L is the linear size of the system. As we discuss in 
Sec. |4.3.1[ the same appears to be true in generic diffusive many-body quantum systems. 
Namely, that if one focuses on an energy shell of width ui < Et then fo{E,u}) ~ const, 
so that the ETH ansatz is identical to RMT. In other words, there is no structure in the 
eigenstates of ergodic Hamiltonians in an energy window narrower than the Thouless 
energy. As this window vanishes in the thermodynamic limit, RMT has a very limited 
range of applicability. Note, however, that the level spacing vanishes much faster with 
the system size. Therefore, there is still an exponentially large number of energy levels 
in the region where RMT applies. The situation can be more subtle in systems with 
subdiffusive, for example, glassy dynamics. One can anticipate that /(w) will saturate 
at w < hjT*, where t* is the slowest physical time scale in the system. As long as the 
corresponding energy window contains exponentially many energy levels, one expects 
that RMT will apply in this windowj^ Conversely, the ETH ansatz does not have these 
RMT limitations and is believed to apply to arbitrary energies with the exception of the 


edges of the spectrum. As we will see later, in Sec. 6.8, the dependence of fo{E,uj) on 
u! determines the decay of nonequal-time correlation functions. It also determines the 
relaxation time following a small perturbation about equilibrium (in the linear response 
regime) [29l I154j . In ergodic systems, it is expected that the diffusive time gives the 
slowest time scale in the system |155j . Within ETH, this follows from the fact that the 
function fo{E,uj) becomes structureless (constant) for u < Et- 


4-2.1. ETH and Thermalization 


The ETH ansatz (62) has immediate implications for understanding thermalization in 


many-body quantum systems. First, let us focus on the long-time average of observables. 
If there are no degeneracies in the energy spectrum, which is a reasonable assumption for 


generic quantum systems after removing all trivial symmetries, we obtain [using Eq. (60)] 


O = lim ^ f dt 0{t) = \C„ 

to-s-oo to Jo ^ 


-Q 


= Tr 


/5deO 


(65) 


a very recent work by Luitz and Bar Lev arXiv:1607.01012 , the Gaussian ansatz for the off-diagonal matrix 
elements was found to hold for diffusive spin chains. At the same time it was found that, in sub-diffusive disordered 
chains, the distribution of off-diagonal matrix elements becomes non-Gaussian (it acquires long tails). 
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where /Ode is the density matrix of the diagonal ensemble, dehned in Eq. (53). On the 
other hand, statistical mechanics predicts 


Ome = Tr 


pymO 


( 66 ) 


where /5 me is the density matrix of the microcanonical ensemble (due to ensemble equiv¬ 
alence one can, of course, use a canonical, or any other equilibrium, density matrix 
instead). We then see that, independent of the actual values of C^, so long as energy 
fluctuations in the diagonal ensemble 




(67) 


are sufficiently small (e.g., behaving like in traditional statistical mechanics ensembles), 
O will agree (to leading order) with the statistical mechanics prediction OmEj provided 

that Tr pmf,H = (?/;/|i7|V’/) = {E). This is because, using the ETH ansatz (62), one 


can rewrite Eqs. (65) and 


as 


O ~ 0{{E)) ~ Ome. 


( 68 ) 


Furthermore, given Eq. (62), one can quantify the difference between the two ensembles 


due to the fact that 5E is finite. Indeed, expanding the smooth function 0{E) into a 
Taylor series around the mean energy (E) 


Omm ~ 0{{E)) + {Em - {E}) ^ 


(E) 


1 

+ l^{Em-{E)) 


(E) 


(69) 


and substituting this expansion into Eq. (65), we find 


0^0{{E)) + -i5E)^0"{{E)) 


Ome + 2 [(dEf - {6 Eme?] 0"{{E)), (70) 


where SEme are the energy fluctuations of the microcanonical ensemble, which are subex- 
tensive. If the energy fluctuations 5E in the time-evolving system are subextensive, which 
is generically the case in systems described by local Hamiltonians (see, e.g., the discus¬ 
sion in Sec. 4.3.2), then the second term is a small subextensive correction to OmE; which 


is negligible for large system sizes. Moreover, the same Eq. (70) describes the difference 


between the equilibrium canonical and microcanonical expectation values of O if instead 
of 5E^ one uses energy fluctuations of the canonical ensemble. It is remarkable that, 
using ETH, one can show that O ~ Ome without the need of making any assumption 
about the distribution of Cm, beyond the fact that it is narrow. This is to be contrasted 
with the standard statistical mechanics statement about equivalence of ensembles, for 
which it is essential that the energy distributions are smooth functions of the energy 
Using the ETH ansatz, one can also calculate the long-time average of the temporal 


^^The energy distributions after quenches to nonintegrable Hamiltonians are expected to be smooth, see Sec. |3.4| 
However, because of ETH, thermalization in nonintegrable systems occurs independently of whether the energy 
distributions are smooth or not. 
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fluctuations of the expectation value of the observable O 

= lim r [ - (of 

to^oc to Jo 

= hm i ^dt V OrnnOpgC*^CnC;Cqe^^^--^-+^’^-^^^^ - {Of (71) 

to^oo to Jo 

= X] IC'mPiC'nPlOmnP < max|0mn,p ^ |C^P|C'„P = max|0mn,p OC exp[-S'(^)]. 

m,nj^m m,n 


Thus, the time fluctuations of the expectation value of the observable are exponentially 
small in the system size. These fluctuations should not be confused with the fluctu¬ 
ations of the observable that are actually measured in experiments, which are never 
exponentially small [23 EH]- Instead, Eq. tells us that at almost any point in time 
the expectation value of an observable 0 is the same as its diagonal ensemble expec¬ 
tation value. Thus the ETH ansatz implies ergodicity in the strong sense, that is, no 
time averaging is needed. In Sec. |6.8[ we show that ETH implies that temporal fluctu¬ 
ations of extensive observables satisfy standard fluctuation-dissipation relations. Let us 
point in passing that the results discussed so far are not restricted to pure states. They 
all straightforwardly generalize to mixed states by using the following substitutions in 
Eqs. (60), (65), and ([5^: —)• pmn and —)• pmm-, where pmn are the matrix 


elements of the initial density matrix in the basis of the eigenstates of the Hamiltonian. 

To contrast Eq. 0 with the fluctuations of 0 seen in experiments, let us also show 
the expression for the latter: 


(50^ = lim — 

to^OO tQ 


'dtm)\iO-Of\m)=T.\(^rn\\0^ 


-of 


(72) 


This quantity is nonzero even if the initial state is an eigenstate of the Hamiltonian 
ilJ’i) = 1^))) while (To is zero in that case. Assuming that 5E is sufficiently small, and 
using the ETH ansatz for 0^, we find 


(502 


50^ 


ME + 2 


0^"{{E)) - 20((E))0"((E)) [6E^ - (JEme)' 


(73) 


And we see that the fluctuations of O scale as the equilibrium statistical fluctuations of O. 
However, in this case, there is a second term which can be of the same order. We note that 
Eq. (73) describes the difference between the canonical and microcanonical fluctuations 
of O if instead of 5E‘^ one uses energy fluctuations of the canonical ensemble, that is, 
same order corrections to fluctuations also occur in equilibrium statistical mechanics. In 
generic cases, for example, for extensive observables in systems away from critical points. 


V60'^ jo ~ (50^e/Ome — 1 where V is the volume of the system. 

An important question that we leave unaddressed here is that of relaxation times. 
Namely, how long it takes for an observable to reach the diagonal ensemble result. The 
answer to this question depends on the observable, the initial state selected, and the 
specifics of the Hamiltonian driving the dynamics. As we will show when discussing 
results from numerical experiments, the relaxation times of observables of interest in 
lattice systems are not exponentially large. They actually need not even increase with 
increasing system size. 

Summarizing our discussion so far, we see that the language used to describe ther- 
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malization in isolated quantum systems is quite different from that in classical systems. 
Chaos, ergodicity, and thermalization are hidden in the nature of the Hamiltonian eigen¬ 
states. Relaxation of observables to their equilibrium values is nothing but the result 
of dephasing, as follows from the second term (in the last line) in Eq. (60). Thus, the 
information about the eventual thermal state is encoded in the system from the very 
beginning, the time evolution simply reveals it. In classical systems, one usually thinks 
of thermalization in very different terms using the language of particle collisions and 
energy redistribution between different degrees of freedom. It is important to realize that 
both approaches describe exactly the same processes. In Sec. 8^ we will briefly discuss 
how one can understand relaxation in weakly nonintegrable quantum systems through 
the language of quantum kinetic equations. Kinetic equations, when justihed, provide a 
unified framework to describe relaxation in both quantum and classical systems. 


^.2.2. ETH and the Quantum Ergodic Theorem 

Now that we have formulated ETH and seen its consequences for the dynamics of isolated 
quantum systems, let us come back to von Neumann’s ergodic theorem and discuss how 
it relates to ETH (or, more precisely, to RMT) [145| . As said before, von Neumann 
was interested in understanding what happens to observables during the unitary time 
evolution of all possible states drawn from the microcanonical shell. His theorem was 
then about the behavior of typical observables at most times. To state it, we follow the 
discussion by Goldstein et al. in Ref. m- For a recent generalization of this theorem, 
see Ref. |156j . 

von Neumann considered a Hamiltonian H with eigenstates \m) and eigenvalues 
Em, that is, H\m) = Emlm), and focused on a microcanonical energy window of 
width 5E around an energy E. This microcanonical energy window defines a Hilbert 
space of dimension T), which is spanned by T) energy eigenstates \m) with ener¬ 
gies Em € {E — 5E/2,E + 5E/2). For example, every state in the microcanonical en¬ 
ergy window can be decomposed as \^p) = where Cm = {mltp). The 

Hilbert space is then decomposed into mutually orthogonal subspaces of di¬ 
mensions du, such that = 0 ^^=^ and V = Yhpdy. Finally, the observables in 
are written as O = where A is the projector onto Here, both V 

and A are assumed to be large. By definition, the expectation value of the observ¬ 
able at time t is 0{t) = {'ipl exp[iE[t]0 exp[—iHt]\^jJ) while its microcanonical average is 
(0 )me = Neumann’s quantum ergodic theorem states that: In 

the absence of resonances in H, namely, if Em — En ^ E'^^ — E'^ unless m = m' and 
n = n', OT m = n and m' = n', and provided that, for any v, 


max 

m 


^(m| A|^) 



-|- m.ax\{m\Py\n)\‘^ is exponentially small, 

m^n 


(74) 


then 


\0{t) — (O)meP < e(0^)ME 


(75) 


for all but a fraction 5 of times t, where e and 6 are small numbers. It 
condition (74) guarantees that the eigenstate expectation value of O 


is easy to see that 
is identical to the 
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microcanonical prediction [US]. In fact: 


(m|0|m) 




E o. 


{m\P^\m) 

V 


{m\d\m) _ ^ 

2 _^ -^- = (<^)me 


(76) 


where the second equality 
we have used that 


holds up to exponentially small corrections, see Eq. (74), and 
{m\Pi,\m) = dp. Next, we have that 


(m|d|n) = Ou{'m\Pu\n), 

V 


(77) 


which is exponentially small if {m\Py\n) is exponentially small [as required in Eq. (74)] 
and if is not exponential in system size (as expected for physical observables). 

We then see that Eqs. (76) and (77) are nothing but the RMT predictions summarized 
in Eq. (20), or, equivalently, the ETH ansatz restricted to the Thouless energy window 
5E ~ Et = hD/L?‘, where the function f{E,oj) is approximately constant. Without this 


condition, Eq. 


cannot be satisfied. Ultimately, Eqs. (20), (62) and (74) rely on the 


fact that the overlap between the energy eigenstates and eigenstates of the observables is 
exponentially small [145] . It is important to note that RMT provides a wealth of infor¬ 
mation about the statistics of the level spacings and of the eigenstate components, which 
we have connected to quantum chaotic Hamiltonians, that was absent in von Neumann’s 
(much earlier) theorem. ETH goes beyond RMT (and the quantum ergodic theorem), as 
we mentioned before, because it addresses what happens outside the featureless Thouless 
energy shell. 


4.3. Numerical Experiments in Lattice Systems 
4 . 3 . 1 . Eigenstate Thermalization 

Numerical evidence of the occurrence of eigenstate thermalization has been found in a 
number of strongly correlated nonintegrable lattice models in fields ranging from con¬ 
densed matter to ultracold quantum gases. Such an evidence was first reported for a 
two-dimensional system of hard-core bosons |28| , and, since then, among others, it has 
been reported for a variety of models of hard-core bosons and interacting spin chains 
[7ni[9m fTM[T57HT64] . spinless and spinful fermions |126l 11581 116511166] , soft-core bosons 
|160L 11641116711169] , and the transverse field Ising model in two dimensions [170] . Below, 
we discuss the evidence for ETH separately for the diagonal and the off-diagonal matrix 
elements. 


4 .3.1.1. Diagonal matrix elements. We begin by illustrating the behavior of the diag¬ 
onal matrix elements of observables in the lattice hard-core boson model in Eq. (55), 
which transitions between the integrable limit and the chaotic regime as J' = V' departs 


from zero (see Sec. 3.4) 

In Eig. (d), we show in panels (a)-(c) the energy eigenstate expectation values of the 
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Figure 13. Eigenstate expectation values of the occupation of the zero momentum mode [(a)-(c)] and the kinetic 
energy per site [(d)—(f)] of hard-core bosons as a function of the energy per site of each eigenstate in the entire 
spectrum, that is, the results for all /c-sectors are included. We report results for three system sizes (L = 18, 21, 
and 24), a total number of particles N = L/3, and for two values of J' = V' [J' = V' = 0.16 in panels (b) and (e) 
and J' = V' = 0.64 in panels (c) and (f)] as one departs from the integrable point [J' = F' = 0 in panels (a) and 
(d)]. In all cases J = V = 1 (unit of energy). See also Ref. CSZ]- 


zero momentum mode occupation 


m{k) = ^ . 


(78) 


In panels (d)-(f), we show the kinetic energy per site 




-I ^ 

j=i 


-J ibybj^i+R.c.) - J' 


+ H.c. 


(79) 


Eigenstate expectation values are plotted as a function of the eigenenergies per site 
(Ep/L), for three different system sizes as one increases J' = V'. The qualitative behavior 
of mpp{k = 0) and Kpp vs Ep/L, depicted in Fig. (13), has been observed in other few- 
body observables and models studied in the literature, and, as such, is expected to 
be generic. The main features to be highlighted are: (i) At integrability, mpp{k = 0) 
and Kpp can have quite different expectation values [see, particularly, mpp{k = 0)] in 
eigenstates of the Hamiltonian with very close energies. Moreover, the spread does not 
change with increasing system size and the variance (not shown) decreases as a power 
law of the system size. Similar results have been obtained in other integrable models 
for larger system sizes than those available from direct full exact diagonalization of the 
Hamiltonian |1311ll33l 117111172j . (ii) As one departs from J' = V', or as one increases 
the system size for any given value oi J' = V' ^ the spread (or maximal differences) 
between the eigenstate expectation values in eigenstates with very close energies decrease. 
This is true provided that the eigenstates are not too close to the edges of the spectrum. 

Recently, Kim et al. m studied the eigenstate-to-eigenstate fluctuations Vp = 
Op_i_ip+i — Opp of both the x-component of the magnetization in a nonintegrable trans¬ 
verse Ising chain with a longitudinal field and of the nearest neighbor density-density 
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Figure 14. The first, second, fourth, and eighth largest values of Vp in the central half of the spectrum, as well as 
its mean value, are shown from top to bottom in both panels, (a) Results for the x-component of the magnetization 
in a nonintegrable transverse Ising Hamiltonian, (b) Results for the nearest neighbor density-density correlations 
in a nonintegrable hard-core boson Hamiltonian. From Ref. m\- 


correlations in the nonintegrable hard-core boson model (55). The resnlts for the average 
value of |rp|, and for some of the largest values of (in the central half of the spectrum), 
are shown in Fig. 14 as a function of the system size. They support the ETH expectation 
that eigenstate-to-eigenstate fluctuations decrease exponentially fast with increasing sys¬ 
tem size (similar results were obtained in Ref. |170j for the transverse field Ising model in 
two dimensions). Evidence that the variance of the eigenstate-to-eigenstate fluctuations 
of various observables decreases exponentially fast with increasing system size has also 
been presented in Refs. |159( I16n( I170j . 

The results discussed so far suggest that, away from the edges of the spectrum and 
for sufficiently large system sizes, any strength of an integrability breaking perturbation 
ensures that the first term in Eq. (62) describes the diagonal matrix elements of physical 
observables. By sufficiently large system sizes, we mean the same conditions that were 
discussed for the onset of quantum chaotic behavior in Sec. Systems that exhibit a 
many-body localization transition do not conform with this expectation m- 


4-3.1.2. Off-diagonal matrix elements. In Eig. |15t we show the matrix elements of the 
zero momentum mode occupation rh{k = 0) between the 100 eigenstates whose energy is 
closest to the energy of the canonical ensemble with temperatur^^ T = 3. Eigure[l^a) 
and [T^b) illustrates some of the most important properties of the off-diagonal matrix 
elements of few-body observables in integrable and nonintegrable systems. They have 
been discussed in Refs. [28l [TOl I158| for various lattice models in one and two dimensions 
and, recently, systematically studied in Refs. |154( 1159] 1164] . The first obvious property, 
seen in Eig. [T^a) and|15Kb), is that no matter whether the system is integrable or not, 
the average value of the off-diagonal matrix elements is much smaller than the average 
value of the diagonal ones. In the integrable regime, Eig. |15[a), a few off-diagonal matrix 
elements can be seen to be relatively large, while many are seen to be zero |154L 1164] . 
In the nonintegrable regime, Eig. [Hb), the (small) values of the off-diagonal matrix 
elements appear to have a more uniform distribution. Note that, in contrast to the 
integrable limit, no relatively large outliers can be identified among the off-diagonal 
matrix elements in the nonintegrable regime. In the latter regime, the values of the off- 
diagonal matrix elements have been shown to exhibit a nearly Gaussian distribution with 
zero mean [1591 |164j . and to be exponentiallv small in svstem size HMj. 

A better quantitative understanding of the behavior of the off-diagonal matrix elements 
of observables can be gained by plotting them as a function of Ep — Eq for a small window 


= 3 was selected so that the eigenstates considered are not too close to the ground state and do not correspond 
to infinite temperature either. These results are relevant to the quenches discussed in Sec. 14.3.21 
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Figure 15. OfF-diagonal matrix elements of m(k = 0) in the eigenstates of the Hamiltonian for a system with 
L = 24, N = -L/3, and J = V = 1 (unit of energy), (a) J' = V' = 0 and (b) J' = V' = 0.32. Results are shown for 
the matrix elements between the 100 eigenstates with energy closest to (a) E/L = —0.16 and (b) E/L = —0.19. 
Those energies were selected from canonical ensembles with T = 3 in both systems. See also Ref. [T58]. 


of values {Ep + Eq)/2. This is done in Fig. 16 for a one-dimensional model of hard-core 
bosons with the Hamiltonian |154j 


L-l 

h = -jJ2 




H.c. 


+ ''T.rf±h + oi: 


j<l 


\j-l\ 


X]hj. 


(80) 


The number of bosons was set to be L/3. The three terms in this Hamiltonian describe, 
from left to right, hopping (J = 1 sets the energy scale), dipolar interactions, and a 
harmonic potential [xj is the distance of site j from the center of the trap). We note that 


H in Eq. (80) is not translationally invariant so that the thermodynamic limit needs to 
]. For V = 0, this model is integrable (mappable to noninteracting 


be taken with care 
spinless fermions) irrespective of the value of J and g. 

In Fig. [T^a), we show results at integrability (V = 0 and <7 7 ^ 0), while, in Fig. [l6)(b), 
we show results away from integrability {V = 2 and g ^ Q). For both cases, results 
are reported for two observables, the site occupation at the center of the trap and the 
zero momentum mode occupation. The off-diagonal matrix elements of both observables 
are qualitatively different in the integrable and nonintegrable regimes. In the integrable 
model, there is a small fraction of large outliers among the matrix elements (whose 
absolute value is orders of magnitude larger than that of the median of the nonvanishing 
absolute values). In addition, there is a large fraction of matrix elements that vanish. As 
a matter of fact, one can see in Fig. [I^a) that only a few off-diagonal matrix elements of 
the site occupation are nonzero (this observable is the same for hard-core bosons and for 
the noninteracting fermions to which they can be mapped |33jl. For the zero momentum 
mode occupation (which is not the same for hard-core bosons and noninteracting fermions 
133]), the histogram of the differences between the absolute values and their running 
average [inset in Fig.|16|(a)] makes apparent that there is also a large fraction (increasing 
with system size |154j l of vanishing matrix elements. This demonstrates that, in the 
integrable model and for the observables shown, the off-diagonal matrix elements are 
not described by the ETH ansatz. In contrast, one does not find large outliers among 
the off-diagonal matrix elements in the nonintegrable model. In addition, the near flat 
histogram in the inset in Fig. [T6|(b) shows that there is no large fraction of them that 
vanish as in the integrable case. One can then conclude that the running average of 
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Figure 16. Absolute value of the off-diagonal matrix elements of the site occupation in the center of the system 
[h{x = 0)] and of the zero momentum mode occupation [rh{k = 0)] in the eigenenergy basis vs the eigenenergy 
difference Ep — Eq, for a small window of energies (Ep Eq)l2 (the center of the window was selected to be 
the energy of a canonical ensemble with T = 5). Results are shown for an integrable (a) and a nonintegrable (b) 
system with L = 18. Lines are running averages (|Opg|avg) for the matrix elements of 'm(k = 0). The insets show 
histograms of the relative differences between the matrix elements of m{k = 0) and the running averages. The 
relative difference is defined as (|Opg| — |Opg|avg)/|Opg|avg- The running averages were computed over 50 matrix 
elements for L = 15 and over 200 matrix elements for L = 18. Adapted from Ref. 


the off-diagonal matrix elements, that is, the absolute value of the function fo{E,uj) in 
Eq. (62), is a well-defined quantity in nonintegrable systems. 

This function is studied in detail in Fig. 17 for the occupation at the center of the 
trap (see figure caption for a precise definition of the observable). Results are reported 
for the same system described above but with L/2 bosons. By selecting a narrow energy 
window in the center of the spectrum, and by comparing results for two different system 
sizes (including the largest we are able to solve numerically), it is possible to identify 
three qualitatively different regimes at large, intermediate, and small energy separation 
u! = Ep — Eq [in what follows, we drop “E” from fo{E,uj), keeping in mind that E 
is that in the center of the spectrum]. These three regimes are shown in panels (a), 
(b), and (c). (a) For a; S> 1, the function |/o(w)| decays exponentially and the curves 
corresponding to different system sizes show an excellent collapse supporting the ETH 
ansatz ( [^ . (b) At intermediate u, |/o(a;)| is proportional to and, around the point 
marked with a vertical dashed line, one can see a broad peak whose position scales with 
L~^. (c) For a; <C 1, \fo{io)\ exhibits a plateau. Our results suggest that |/o(w)| in the 
plateau is proportional to and that its width is proportional to L~^. The results 

in panel (c) are noisier than in panels (a) and (b) because of poor statistics, which is 
the result of having only few pairs of eigenstates in the center of the spectrum such that 
00 = Ep — Eq 1. 

The three regimes identified above, for large, intermediate, and small values of oo, de¬ 
termine what happens to the observable at short, intermediate, and long times during the 
dynamics (c.f., Sec. 6.8). In the fast, high-frequency, regi me |/ o(a;)| is an exponentially 


decaying function independent of the system size. In Sec. 6.8 we show that |/o(a;)| 


|2 ; 


IS 


related to the spectral function of the observable O and to the dissipative part of the 
linear response susceptibility. Its exponential decay at high frequencies is expected on 
general grounds from perturbation theory, at least for systems with a bounded spec- 
trumj^ Such a high-frequency exponential tail was discussed, for example, in Ref. [173] 
for the conductivity, corresponding to the case where the observable O is the current op- 


^^When the energy spectrum is bounded, in order to absorb energy cj 1, many-body processes are required. 
These processes appear only in high orders of perturbation theory, which leads to an exponential suppression of 
|/o(w)| for a; > 1. 
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Figure 17. Plot of the function \fo{^)\ vs the eigenenergy difference u) = Ep — Eq for a system oi L12 bosons 
described by the Hamiltonian | |80| | with parameters J = U = 1 and g = 16/(Z/ — 1)^. The observable O is 
the occupation at the center of the trap (more precisely, the average of the occupation of the two central sites, 
as the calculations were done in the even sector when taking into account reflection symmetry and the center of 
the trap is in the middle of two sites) and the function \fo{^)\ is obtained in a small energy window centered 
around the middle of the spectrum, (a) At large uj, |/o(^)l decays exponentially, (b) At intermediate cj, \fo{^)\ 
is proportional to and has a broad peak whose position scales as L~^. (c) At small u, |/o(^)| exhibits a 

plateau. \fQ{uj)\ in the plateau is proportional to and the extension of the plateau is proportional to L~^. 


erator. (ii) At intermediate times, the independence of \fo{u})\ vs ujL on the system size 
indicates the existence of ballistic dynamics, (iii) At long times, the approximate collapse 
of \fo{u})\ vs loL‘^ for different system sizes indicates diffusive dynamics. Remarkably, at 
frequencies smaller than a characteristic frequency Uc ~ 1/L^ (corresponding to times 
longer than the diffusive time tc ~ A^), the function \fo{u})\ saturates at a constant value 
proportional to It is in this regime that the ETH ansatz (62) becomes equivalent 
to the RMT ansatz As the diffusive time is the longest relaxation time scale in the 


system, one expects that in this regime physical observables do not evolve. The fact that 
|/o(a;)| at the plateau is proportional to A^/^ can be understood as follow. The function 
|/o(w)p is related to the nonequal-time correlation function of the observable O, see 
Sec. |6.8[ In particular, when evaluated at w = 0, we have 


|/o(a; = 0)poc / dt {d{t)d{0) + d{0)d{t))c (x 


dt 

oc V A oc A, (81j 
Vt 


where we have used that the diffusive time scale A sets an upper bound for the time 
integral, and that, assuming diffusive behavior, the nonequal-time correlation function 
of O is expected to be oc ^ then follows that /o(<^ = 0) oc \/A. Remarkably, 

our results suggests that the scaling of fo{oj) with A^/^ is also valid at the intermediate 
frequencies that are relevant to ballistic transport [see Fig. [I7|(b)]. 


4-3.2. Quantum Quenches and Thermalization in Lattice Systems 


Now, let us see what happens when systems such as those studied in Sec. 4.3.1 


are 


taken out of equilibrium. Among the most common protocols for taking systems out 
of equilibrium are the so-called sudden quenches or, simply, quenches. As explained 
in Sec. |3.4[ in a quench the system is assumed to be initially in equilibrium and then 
suddenly some parameter(s) is (are) changed. The dynamics proceeds without any further 
changes of parameters. For example, in ultracold gases experiments in optical lattices, 
one can suddenly change the depth of the optical lattice [SHazmi], displace the center 
of the trapping potential |174H176j . or turn off a trapping potential while keeping the 
optical lattice on |l3l I177( 1178] . Theoretically, one can think of a quench as a protocol 
in which one starts with a stationary state of a given Hamiltonian, often the ground 
state, and then suddenly changes some Hamiltonian parameter(s). The initial state is 


^®It is likely that ~ /D, where D is the diffusion constant. 
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not stationary in the new (time-independent) Hamiltonian, as a result of which it has a 
nontrivial unitary dynamics. Quenches in which one changes parameters throughout the 
system are called global quenches, while quenches in which parameters are only changed 
in a finite region are called local quenches. In the former class of quenches, one generally 
adds an extensive amount of the energy to the system, while, in the latter class, the 
change in energy is subextensive. 

A remarkable property of quantum quenches involving local Hamiltonians is that one 
can actually prove, under very general conditions, that the width 6 E of the energy 
distribution after a quench scales with the square root of the volume (or of the number 
of particles) |28j . This behavior is expected from thermodynamics, and is essentially a 
consequence of the central limit theorem. This width sets the effective “microcanonical 
window” of the equivalent thermodynamic ensemble. It depends on the details of the 
initial state and the quench protocol. 

To prove that after a global quench 6 E w, we consider, for concreteness, a lattice 
system prepared in an initial state \'ijji) which is an eigenstate (not necessarily the ground 
state) of the initial Hamiltonian Ho. After the quench, the Hamiltonian is H = Hq + Hi, 
where Hi is a sum of local operators Hi = hj. One can then write |28) 


6 E = {i;i\H^\iji) - {i;i\H\^i)^ = yj{i;i\Hf\i;i) - {^l;i\Hi\i;i)^ 




(82) 


Jl,j2 


From the expression above, one concludes that, in the absence of long-range connected 
correlations between hj in the initial state, and if all matrix elements are finitethe 
width 6 E scales at most as the square root of the number of lattice sites in the system, 
that is, dE ~ \/V■ Because the energy itself is extensive in the volume of the system, we 
see that the relative energy fluctuations are inversely proportional to the square root of 
the volume 6 E/E ~ 1/\/H as expected from equilibrium thermodynamics. This result is 
a consequence of the locality of the Hamiltonian and, hence, is unrelated to whether the 
system is integrable or nonintegrable. This scaling of energy fluctuations, in combination 
with eigenstate thermalization, ensures that in generic systems with local interactions 
thermalization occurs after a quench. Generalizing this proof to continuous systems is 
straightforward. 

Next, we address two important questions whose precise answer depends on the 
specifics of the system and of the observable of interest, but whose qualitative answer 
has been found to be quite similar for several strongly correlated lattice models and 
observables studied. The first question is how long it takes for experimentally relevant 
observables to relax to the diagonal ensemble predictions. The second one is how large 
the system sizes need to be for the relative difference between the diagonal ensemble 
and the statistical mechanics predictions to be small. These questions have been mainly 
addressed in numerical experiments. We reproduce some results of these numerical ex¬ 
periments below. 


4-3.2.1. Dynamics. We consider the dynamics of observables in the hard-core boson 
model (55). Some numerical results for this model were already discussed in Secs. 3.4 


and 4.3.1 For the quench dynamics discussed here, the initial states are taken to be 


eigenstates of the Hamiltonian with Jj = 0.5, Vj = 2, J' = V', and the time evolutions 


^®This is guaranteed if the operator norm of each hj is finite 
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Figure 18. (a)—(c) Relative difference between the instantaneous momentum distribution function and the diag¬ 

onal ensemble prediction (see text) as a function of time, (d)—(f) Relative difference between the instantaneous 
kinetic energy and the diagonal ensemble prediction (see text) as a function of time. The strength of the inte- 
grability breaking terms [J' = V') increases from top to bottom: (a),(d) J' = V' = 0 (integrable point); (b),(e) 
J' = V' = 0.16; and (c),(f) J' = V' = 0.64. Results are reported for two system sizes (L = 21 and 24) and 
N = L/3. In all cases, J = V = 1 (unit of energy). Time is given in units of h/J. See also Ref. usa. 


are studied under final Hamiltonians with J = V = 1 (unit of energy), and J' = V 
m\- Hence, only the nearest neighbor parameters are changed during the quench. The 
strengths {J' = V') of the integrability breaking terms remain unchanged. To characterize 
the dynamics of the entire momentum distribution function and of the kinetic energy 
(by comparing them to the diagonal ensemble results), the following relative differences 
are computed 


6 m{t) = 


Y,k\'m{k,t) - mDE(fc)| 
Efe”^DE(A:) 


and 6 K{t) = 


\K{t) - K-[)e\ 

AdeI 


(83) 


respectively. In these expressions, t refers to time and the subscript “DE” refers to the 


diagonal ensemble prediction [recall Eq. (53)]. In order to be able to compare results 
for systems with different Hamiltonian parameters in a meaningful way, the initial state 
for each quench is selected to be the eigenstate of the initial Hamiltonian that, after 
the quench, has the closest energy to that of a system with temperature T, namely, 
{i/jilHlipj) = Tr[Hexp{—H /r)]/Tr[exp(—H /T)], where the Boltzmann constant is set to 


unity. For the quenches discussed in what follows, T = 3 as in Fig. 15 This temperature 
is such that eigenstate thermalization can be seen in these small systems and 0{E) is 
not featureless as expected in the center of the spectrum (i.e., at “infinite temperature”). 

In Fig. |18Ka)-|18Kc), we show results for 6 m{t) vs t for systems with L = 21 (blue lines) 
and L = 24 (red lines), and for three values of J' = V. The behavior of 5m{t) vs t is qual¬ 
itatively similar for all values of J’ = V. Namely, at t = 0, one can see that 5m is large 
(> 10%, except for J' = V' = 0.16) and then it quickly decreases (in a time scale of the 
order of h/J) and starts oscillating about a small nonzero value (~ 2% for L = 24). With 
increasing system size, the value about which 5m{t) oscillates, as well as the amplitude 
of the oscillations, decrease. A qualitatively similar behavior, though with a significantly 
smaller mean and amplitude of the oscillations about the mean, can be seen during the 
time evolution of 6 K{t) [Fig. [T^d)-fiRf)]. Comparable results have been obtained for 
other nonintegrable models and observables [28l [^ 1154111571115811169L I179H185| . All that 
numerical evidence makes clear that, despite the exponentially small (in system size) level 
spacing in many-body quantum systems, the relaxation of physically relevant observables 
to the diagonal ensemble results does not take exponentially long time. Furthermore, in 
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Figure 19. (Main panels) Relative difference between the predictions of the diagonal and microcanonical ensembles 
for the momentum distribution function (see text) as a function of the strength of the integrability breaking terms. 
(Insets) The same as the main panels but for the kinetic energy. Results are reported for hard-core bosons (a) and 
for spinless fermions (b), and for two system sizes L = 21 (blue lines) and L = 24 (red lines) with N = L/3. In all 
cases, J = V = 1 (unit of energy). See also Refs. |157l 115^ . 


accordance with our expectations based on the ETH ansatz, numerical experiments have 
also shown that the scaling of the variance of the time fluctuations of expectation val¬ 
ues of observables is consistent with an exponential decrease with increasing system size 


mi- 

Note that the results reported in Fig. 18 were obtained in systems in which there are 
only seven and eight hard-core bosons, for L = 21 and L = 24, respectively. Namely, 
the time fluctuations of the expectation values of observables can be very small even for 
systems with a very small number of particles (see Ref. |28j for an analysis of a two- 
dimensional system with only five hard-core bosons that exhibits a qualitatively similar 
behavior). 


4-3.2.2. Post relaxation. After showing that even small finite systems relax to the pre¬ 
dictions of the diagonal ensemble and remain close to them, we need to check how close 
the diagonal ensemble predictions are to those made by standard statistical mechanics. 
This is the final step needed to know whether thermalization takes place. Since we are 
dealing with small systems, which ensemble is taken among the microcanonical, canoni¬ 
cal, and grand canonical ensembles makes a difference. Considering that the systems of 
interest here are isolated, the most appropriate statistical ensemble is the microcanonical 
ensemble |28[I157| . Therefore, we compute the following relative differences to character¬ 
ize whether the system thermalizes or not 

A _ — rnME{k)\ , a iv _ Ade — ATme| 

ZkmBEik) ’ IT^deI 



In these expressions, the subscripts “DE” and “ME” refer to the diagonal and micro- 
canonical ensemble predictions, respectively. 

The main panel (inset) in Fig. [iRa) depicts results for Am (AK) in the hard-core 


boson systems whose dynamics was reported in Fig. 18 The corresponding results when 
the hard-core bosons are replaced by spinless fermions are shown in Fig. [iRb). One 
can see that the behavior of Am (AK) is qualitatively similar for hard-core bosons and 
spinless fermions. The largest differences between the predictions of the diagonal and 
microcanonical ensembles are seen at (and close to) the integrable point. As one departs 
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from the integrable point, the differences decrease. After a fast decrease, there is an 
interval of values of J' = V' at which Am (AK) becomes almost independent of the 
exact value of J' = V (up to finite-size fluctuations). In that interval. Am (AK) can 
be seen to decrease as one increases system size (up to finite-size fluctuations). This is 
consistent with the expectation that those differences vanish in the thermodynamic limit. 
Numerical evidence that, in the thermodynamic limit, the predictions of the diagonal 
ensemble for observables after a quench to a nonintegrable model are identical to those 
from traditional statistical mechanics ensembles has been obtained in numerical linked 
cluster expansion studies |130[ll35j . On the other hand, in quenches to integrable points in 
the thermodynamic limit, it was found that lack of thermalization is ubiquitous |1301I135] 
(see Sec. 8.3). For finite systems, it is striking that the differences between the diagonal 
and the microcanonical ensembles can be a fraction of a percent even for systems with 
less than 10 particles (which makes their experimental detection unlikely). Similar results 
have been obtained for other observables in Refs. [28l [99l 1154111571115811169] . 

The results presented in this section support the expectation that nonintegrable quan¬ 
tum systems exhibit eigenstate thermalization and therefore thermalize in the strong 
sense as defined in this review. 


5. Quantum Chaos and the Laws of Thermodynamics 

If one assumes that a system is prepared in a Gibbs, or other equivalent ensemble, then 
one does not need assumptions about chaos and ergodicity to prove various statements 
of statistical mechanics and thermodynamics. For example, the fluctuation-dissipation 
relation can be straightforwardly proved using standard perturbation theory. Quantum 
chaos and ETH allow one to prove all the statements for individual eigenstates of chaotic 
Hamiltonians, and therefore for arbitrary stationary ensembles (with subextensive energy 
fluctuations). This distinction is at the heart of the importance of quantum chaos for the 
proper understanding of thermodynamics in isolated systems. In the earlier sections, we 
argued that eigenstate thermalization is generally needed for isolated quantum systems 
taken far from equilibrium to thermalize. Likewise, in the following sections, we will 
show that the same assumptions of quantum chaos together with ETH are sufficient for 
establishing thermodynamic relations in such isolated systems. 


5.1. General Setup and Doubly Stochastic Evolution 

Equilibrium thermodynamics studies transformations between equilibrium states of 
macroscopic systems. During such a transformation, thermodynamic quantities (such 
as the free energy, magnetization, and pressure) evolve in time. These changes are usu¬ 
ally induced by either heat exchange with another macroscopic system or the work done 
on the system via changing some macroscopic parameters in time (like its volume or the 
applied magnetic field), or both. For example, consider a phase transformation from a 
solid to a liquid as temperature is changed, or the exchange of energy, in the form of 
heat and work, in heat engines. The laws of thermodynamics dictate which process are 
possible and which are not. They give bounds for engine efficiencies and provide rela¬ 
tions between superficially different quantities (e.g., the Onsager relations, which will be 
discussed in the next section). 

Since equilibrium thermodynamics provides relations between different equilibrium 
states, the concept of a quasi-static process is central to the development of the theory. A 
quasi-static process is one in which the state of the system is changed very slowly through 
a sequence of equilibrium states. However, it is important to stress that thermodynamics 
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Figure 20. Schematic representation of a gas undergoing a compression 


is not limited to quasi-static processes. For example, in one formulation of the second 
law of thermodynamics one considers an equilibrated isolated system that undergoes a 
dynamical process (which need not be quasi-static). As a result, the entropy difference 
between the final equilibrium state of the system and the initial equilibrium state is 
positive or zero independent of how rapidly the process is carried out. Moreover, this 
entropy difference is uniquely determined by the total energy change in the system, no 
matter how fast or slow the process of energy exchange is. Another remarkable example 
of thermodynamic relations are the recently discovered fluctuation theorems |20[ [25l 11861 
EHTj , which make exact statements about work, heat, and free energy changes in arbitrary 
nonequilibrium processes. 

To derive thermodynamic relations one needs to consider dynamical processes that 
start from a stationary state. To this end, we focus on an isolated system initially prepared 
in a stationary state, which undergoes a unitary evolution in response to an external 
change. The latter is modeled by a change in time of macroscopic parameters in the 
Hamiltonian according to a prescribed protocol. The protocol considered is such that 
the parameters are changed during a finite time, after which the Hamiltonian is time 
independent and the system is allowed to relax to equilibrium. As an example, one 
can think of a gas confined in a container by a piston (see Fig. 20). In this case, the 


macroscopic parameter is the position of the piston, which is changed in time from 
position z = A io z = B according to a protocol z{t). At the end of the process, the 
piston is kept hxed at position z = B and the gas is allowed to equilibrate. While we 
focus on isolated systems, our setup can also describe open systems. This is because, if 
the dynamical process involves changing parameters only in a part of the system (a local 
operation), the rest of the system can act as a thermal bath. If the bath is much larger 
than the part of the system in which the dynamical process is implemented, the intensive 
quantities characterizing the latter (such as the energy density defining the temperature 
or the pressure) after it relaxes to equilibrium will be identical to their initial values. 

Mathematically, the assumption that the systems start in a stationary (i.e., a state 
that is translationally invariant in time) amounts to taking the initial density matrix 
to be diagonal in the basis of the initial Hamiltonian (if there are degeneracies in the 
spectrum, one just needs to find a basis where the density matrix is diagonal) 


„(0) _ (0) r 

Hnm ~ Hnn ^nm- 


(85) 


It is important to note that, at the moment, we make no assumptions on the structure of 
the density matrix beyond stationarity. After the system undergoes a dynamical process. 
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the density matrix in the basis of the final Hamiltonian, that is, the Hamiltonian at 
the end of the dynamical process, is not diagonal anymore. As discussed previously, 
the off-diagonal matrix elements dephase and, at long times, the expectation values of 
observables are solely determined by the diagonal matrix elements. The latter are given 
by 


Pmm — UfnnPnnUji^, 
n 

where Unm are the matrix elements of the evolution operator 


( 86 ) 


JJ = Tt exp 


—i 




) Uifin — (?7 i|H| 77.). 


(87) 


Here, T* denotes time ordering, and the “tilde” indicates that the states \m) are eigen¬ 
states of the Hamiltonian after the evolution, while |n) are the eigenstates of the initial 
Hamiltonian. The two basis sets coincide only in the special case of cyclic processes. 

Equation (86) can be rewritten as a master equation for the occupation probabilities 
of the microstates 


.(1) 


nfO) (u- [/t ^ nfO) r) , - 

/ , Pnn y-'rnn^nfh I — Pn^m 


( 88 ) 


where we have defined the transition probabilities between states \n) and \rh) associated 
with the dynamical process to bt^ 


Pn^ih = U^nUl^ = \Ufanf ■ (89) 

The last equality trivially follows from the identity = (Umn)*, where the star indi¬ 
cates complex conjugation. 

From the unitarity of the evolution operator it follows that 

= Snk, and Uj^^Umn = ' (^b) 

m n 

Setting k = n and A: = m, we immediately see that 

^ ^ Pn^rh — 1 ) ^ ^ Pn^m — 1 * ( 91 ) 

m n 

These conditions allow for a simple physical interpretation if we rewrite it changing the 
dummy indices n and m in the second sum, namely 

^ ^ Pn^rh — ^ ^ Pm^fi' (92) 

m^h m^n 

In other words, the sum of incoming probabilities to any given state \h) of the final 
Hamiltonian is equal to the sum of the outgoing probabilities from an equivalent, for 
example, adiabatically connected state \n) of the initial Hamiltonian. For a cyclic process. 


^^Note that the transition probabilities are conditional, i.e., they define the probability of a transition from state 
\n) to state |m) if the system is initially prepared in state |n). 
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one can remove the tildes and simply say that the sum of incoming probabilities to any 
eigenstate is equal to the sum of outgoing probabilities from the same state. 

The transition probabilities Pn^m = \Unifi\^ are positive semi-dehnite for any pair of 
states. This, combined with the constraints above, leads to 


0 ^ Pn^fh ^ 1 


(93) 


as expected. Any semi-positive matrix p satisfying the constraints (91) and (93) is called 
doubly stochastic |188j . The corresponding evolution described by the master equa¬ 


tion (88) is called a doubly stochastic evolution. 


As it has been known for a long time, the constraints (91) have far-reaching conse 


quences and, for example, play a prominent role in the formulation of the kinetic theory 
of gases |189j . Moreover, doubly stochastic evolution is the proper framework to discuss 
thermodynamic processes in isolated quantum systems since it emerges naturally based 
on the assumptions that: (i) the system starts from a stationary state, (ii) the system 
evolves unitarily, and (hi) the long-time behavior of the observables is determined only 
by the diagonal elements of the density matrix in the basis of the hnal Hamiltonian. 
With this in mind, we review some general properties of the master equation, and those 
associated with doubly stochastic matrices in particular. 


5.1.1. Properties of Master Equations and Doubly Stochastic Evolution 


Equation (88) is a discrete-time master equation. The matrix p, with elements Pn,m = 
Pn^m satisfying the first of the two conditions in Eq. (91), is known as a Markov matrix 


or, equivalently, as a stochastic matrix. The action of p on a probability vector, in our 
case (with elements pnn), gives a new probability vector p^^'> (with elements p^^), 
which is the result of stochastic transitions between the different states of the system. 
For completeness, we briefly review some of the properties of a Markov matrix that will 
be used in our discussion. More details and complete proofs can be found, for example, 
in Ref. |190j . 

We first note that the conservation of probability implies that the outgoing transition 
probabilities from any state must sum to one [first condition in Eq. (91)], so that the 
sum over each column of the Markov matrix is 1. This holds for any master equation. 
Indeed, from Eq. 


EAk = EE Pnn Pn^fh — T.f>nl = r (94) 

fh fh n n 


In general, the matrix p is not symmetric and therefore admits separate left and right 
eigenvectors (the spectrum associated with left and right eigenvectors is the same). If 
one applies the matrix p many times on a probability vector one expects that the prob¬ 
ability distribution relaxes to a steady state. This implies that the matrix p has one 
eigenvalue Aq = 1 whose corresponding right eigenvector is the steady-state probability 
distribution |190j . The existence of the eigenvalue Aq = 1 is straightforward to prove 
as the left vector (1,1,1...) is always, by conservation of probability, the corresponding 
left eigenvector. One can show that if the Markov matrix does not have a block diago¬ 
nal form, which implies that some states cannot be reached from others, then the right 
eigenvector corresponding to the eigenvalue 1 is unique [190] . Finally, we note that the 
relaxation to the steady-state following many applications of p is dictated by the other 
eigenvectors and their corresponding eigenvalues. One then expects that the eigenvalues 
Aj satisfy |Ai| < 1 (this can be proved rigorously |19r)j ). 

Next, we turn to the second condition in Eq. (91), which is associated with the doubly 
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Figure 21. Schematic representation of a doubly stochastic evolution for a cyclic process. An isolated system 
undergoes some dynamical process where the control parameter A changes in time in some arbitrary way (top). 
The cumulative effect of the evolution is given by the transition probabilities (bottom). Red arrows indicate the 
outgoing transition probabilities from the level |2) of the initial Hamiltonian to levels |1) and |3) of the final 
Hamiltonian (which in this case are the same as |1) and |3) because we consider a cyclic process). Black arrows 
describe the incoming transition probabilities to the level |2) of the final Hamiltonian: |1} |2) and |3) |2}. 

Doubly stochastic evolution implies that P 2 i +P 23 ~ P 12 P 32 ‘ 


stochastic nature of p. It states that the incoming transition probabilities to any state 
sum to one. This constraint is less trivial and does not generally hold for non-unitary 
evolutiond* At the same time doubly stochastic evolution is more general than uni¬ 


tary evolution. In particular, the product of two doubly stochastic matrices is a doubly 
stochastic matrix [see Eq. (95)]. This implies that any projective measurement performed 
during the evolution, which breaks unitarity, keeps the transition matrix doubly stochas¬ 
tic. Moreover, any statistical mixture of doubly stochastic matrices is doubly stochastic. 
This implies that if, for example, one repeats slightly different dynamical protocols start¬ 
ing with the same density matrix and ending with the same final Hamiltonian, then the 
transition matrix describing the average effect of these dynamical protocols is still doubly 
stochastic. Because of this, various dephasing mechanisms (e.g., the presence of external 
noise or fluctuating waiting times between different pulses) keep the evolution doubly 
stochastic, even if they generally break its unitarity. The second condition in Eq. (91) is 
a direct consequence of the fact that any doubly stochastic matrix can be represented as 
Pn^m = |(m|H|n)p for some (maybe fictitious) unitary operator U, see Ref. [191j . For a 
unitary process, one can always define its inverse. Therefore, the role of the initial and 
final states is interchangeable, and the same sum rule applies to both summations over 
n and m. The doubly stochastic condition is schematically illustrated in Fig. where 
it is shown that the sum of the outgoing rates from a state |2) (red lines) is equal to the 
sum of the incoming probabilities into the state |2) (black lines). 

The easiest way to satisfy the doubly stochastic constraint is to have identical transi¬ 
tion probabilities between any two pair of energy levels, Pn-s-m = Pm^n- This condition 
is known as detailed balance for an isolated system]^ Detailed balance is satisfied in: 
(i) two level systems, (ii) systems with more than two energy levels within first order of 
perturbation theory (e.g., a Fermi golden rule) |192) . and (hi) systems with real Hamilto¬ 
nians, which satisfy instantaneous time-reversal symmetry, subjected to symmetric cyclic 
protocols such that H{t) = H{T — t), where T indicates the total duration of the dy- 


^®For example, if the Markov process admits an absorbing state, this condition is violated since the sum of the 
incoming transition probabilities into the absorbing state is larger than one. 

^^For systems in contact with a thermal bath at temperature T, the detaile d ba lance condition is Pn^fh = 
exp [^{En — ^m)], where is the inverse temperature. In Sec |6.2| we show how this condition 

follows from the doubly stochastic transition rates. 
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namical process |193| . In general, however, pairwise transition probabilities are not the 
same, that is, the detailed balance condition pn-s-m = Pm^n is violated and only the sum 


rules (91) are satished. 


In passing, we note that doubly stochastic matrices are intimately related to permuta¬ 
tion matrices, as stated by Birkhoff’s theorem, which is presented in Appendix [D| This 
theorem allows one to make physical predictions for arbitrary doubly stochastic evolution 
of systems with many degrees of freedom. For example, it allows one to rigorously bound 
the maximum amount of work that can be extracted from a microcanonical bath |194j . 

Repeated processes. Next, we show that doubly stochastic matrices form a group under 
multiplication, that is, the product of two doubly stochastic matrices p and g is a doubly 
stochastic matrix s (the remaining group properties follow trivially): 


^ 


Vr,- 






(95) 


where, to simplicity the notation, we dropped the tilde over the state labels. Physically, 
Eq. (95) tells us that performing a sequence of two (or more) doubly stochastic processes 
on a system is again a doubly stochastic process. This property allows one to split any 
doubly stochastic process in a sequence of arbitrary many doubly stochastic processes. 
We now apply this result to a concrete setup in which an initially stationary density 


matrix pnn undergoes an arbitrary dynamical process that is interrupted by a sequence 
of ideal projective measurements, i.e., we consider the sequence: 


P 


( 0 ) 



t/l c/2 t/iV 


(96) 


where Uj represents an arbitrary dynamical process and Mj an arbitrary projective 
measurement. Immediately after each projective measurement, the density matrix is 
diagonal in the basis of the projection operator Mj. Hence the previous sequence is 
equivalent to: 


pio) 

rnn > 




( 2 ) 

,Pkk 


p{N) 

/ rmm 


(97) 


C/i 


C/iV 


20 


Each fun- 


where is a diagonal matrix in the basis of the projection operator Mj 
damental block is a doubly stochastic process and is represented by a proper doubly 
stochastic matrix. Eor example: 


. M) ^ Jj) _ „0-i) „(i) /Qo\ 

pii ^Pkk ^ pkk-2^pii Pi^k- 

u, I 

Using the group property discussed above, the sequence is described by a single doubly 
stochastic matrix s obtained as the (matrix) product of the doubly stochastic matrices 
of the fundamental processes: 

Pmm = Pnn S = , (99) 


simplify the presentation, here we denote all basis states using the same notation. One needs to keep in mind 
that the initial state is always diagonal in the basis of the initial (measurement) Hamiltonian and the final state 
is always diagonal in the basis of the final (measurement) Hamiltonian. 
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where the bold symbols indicate matrices. 

One can generally think of projective measurements as quenches to a measurement 
Hamiltonian and dephasing. For example, let us imagine we have a system of (possibly 
interacting) spins and we want to do a projective measurement of the z-magnetization 
of a given spin. Formally, we can simply say that we project this spin to the z-axis 
and read probabilities. Alternatively, one can think about the same process as a quench 
to a very strong local magnetic field along the z-axis, such that the remaining part 
of the Hamiltonian does not matter, and then dephasing (or time averaging) of the 
density matrix. Thus, this combination of quench and dephasing projects the local spin 
density matrix into a statistical mixture of “up” and “down” states. For this reason, the 
factorization property of the transition matrix (99) holds if we have a series of quenches 
with long random waiting times in between. These random waiting times are equivalent 
to the projection of the density matrix to the basis of the Hamiltonian after each quench 
or, formally equivalent, to the projective measurement of the energy of this intermediate 
Hamiltonian. 

In ergodic systems, random waiting times are not needed, it is sufficient to wait times 
that are longer than the relevant relaxation time. As we discussed in the previous section, 
apart from small fluctuations, from the point of view of observables the density matrix 
is effectively dephased (see Ref. [195] for a more formal discussion of this point, and 
Ref. |196j for caveats). If the waiting time between quenches (random or not) is shorter 
than the relaxation time, then the transition matrix describing the whole dynamical pro¬ 
cess is doubly stochastic but it is not the product of the transition matrices corresponding 
to the individual quenches. For example, if one considers a large periodically driven er¬ 
godic system, one can anticipate that, if the driving period is longer than the relaxation 
time, the exact periodicity of the driving protocol is not important and the transition 
probability factorizes (in small ergodic systems the factorization can be violated even for 
long driving periods |196| ). If the period is short compared to the relaxation time, one 
has to use the Floquet formalism (see, e.g., Ref. |197j for review) to accurately describe 
the time evolution after many periods. In this case, the dynamics between periods is 
coherent and the factorization property of the transition probability (99) does not apply. 
Still, the overall evolution remains doubly stochastic. 

Let us now discuss the implications of doubly stochastic evolution for time-reversed 
processes. The two conditions (91) imply that one can define the transpose transition 
rate matrix = Pn^m, which is also a doubly stochastic matrix that corresponds to 

a reversed process in which the role of initial and final states is swapped. For a unitary 
process (i.e., a process without projective measurements or dephasing), the time-reversal 
process corresponds to the transition matrix p^, that is, p*''"' = p^. Indeed, for the 
time-reversal process, the evolution operator is given by U~^ = 1)"^. Therefore 


Pfh^n — I ^lifh I ~ I \ — Pn- 


( 100 ) 


In practice, time-reversal processes are very difficult to realize. They require either an 
overall change of sign of the Hamiltonian or taking the complex conjugate of the wave 
function, which in the classical language is equivalent to changing the sign of the velocities 
of all particles. As noted in Ref. [25], the dynamics leading to the transition probabilities 
p)h^n be achieved much more easily by using the so-called reversed protocol. To see 
the difference between the time-reversal and reversed processes, consider again a unitary 
evolution. The evolution operator and its time inverse are given by the time-ordered 
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exponentials (such that later times t appear on the left): 


U = Tt exp 


—i / H{t)dt 


= Tt 


t exp 


i / H{T-t)dt 


( 101 ) 


L ./o J L ^0 J 

Let us now define the reverse protocol as the forward time evolution with the Hamiltonian 
H(T — t), that is, the Hamiltonian for which we simply reverse the dynamical protocol. 
The corresponding evolution operator is given by 




Ttexp 


—i 


H{T - t)dt 


( 102 ) 


Note that and W are very different. For example, UW is the identity while is 
not. Nevertheless, if the Hamiltonian H[t) is real at each moment of time, that is, satis¬ 
fies instantaneous time-reversal symmetry, then U'^ and lead to the same transition 
probabilities. Using this fact, the eigenstate |n) and \rh) can be chosen to be real in that 
case, we hnd 


{n\tj^\m)* = {n*\ \rh*) = {n\U^\m). (103) 

Therefore 

P^rh^n = = I (^1 l^> I^ (104) 

Unlike the time-reversal process, which generally exists only for unitary evolution, the 
reverse process is defined even if the forward protocol is not unitary. For example, if 
it involves projection measurements along the way. As we discussed, in this case the 
transition probability matrices factorize into products of transition probability matri¬ 
ces corresponding to processes between measurements. It is then straightforward to see 
that for the reversed process, which involves exactly the same sequence of measurement 
performed in the opposite order, one still has = Pn^m- If the protocol is time 

symmetric, that is, H{T — t) = H{t), then U = and hence = pm^n- Combining 

this condition with p^^n — Pn^m, we see that for such symmetric protocols, detailed 
balance is automatically satisfied, i.e., Pn^m = Pm^n- 


5.2. General Implications of Doubly Stochastic Evolution 

We now derive the physical implications of doubly stochastic evolution. The results in this 
subsection rely only on doubly stochasticity and are therefore valid for both nonintegrable 
and integrable systems. 


5.2.1. The Infinite Temperature State as an Attractor of Doubly-Stochastic Evolution 

First, let us consider a cyclic process. In this case, the basis rfi and m are identical and 
the master equation (88) becomes: 


P 


( 1 ) 


= PP 


( 0 ) 


(105) 
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where on the RHS we have a matrix vector multiplication. If we repeat the process N 
times, we obtain 


p(^)=p^p(o). (106) 

On physical grounds one expects that, after many applications of a dynamical process, 
all eigenstates of the Hamiltonian should have an equal occupation: 

Jim Pml = const. = (107) 

N^oo U 

where T) is the dimensionality of the Hilbert space. The state characterized by pmm = 
const, is often called an “infinite temperature state” since it is formally identical to a 
Gibbs distribution, pnn = in the limit /3 —)> 0. The invariance of the infinite 

temperature state under doubly stochastic evolution trivially follows from the master 
equation. By substituting the infinite temperature state (which is the right eigenvector 
of p corresponding to Aq = 1) in the master equation, we obtain: 

= '^Pn^mpin^ = ^ ^ Pn^m = ^ = (^O^) 

n n 

In Appendixwe prove that the infinite temperature state is an attractor of the doubly 
stochastic evolution. The approach to the steady state is controlled, as discussed in Sec. 
5.1. It by the eigenvalues of p whose absolute value is smaller than one. 

Let us discuss in detail the three-level system depicted in Fig. Besides providing 
a concrete example of the approach to infinite temperature, this example clarifies under 
which conditions the system always relaxes to the inhnite temperature state. Instead 
of considering the most general doubly stochastic evolution (which is discussed in ap- 
pendix[D]in connection with Birkhoff’s theorem), we assume that: (i) the process is cyclic 
(therefore we can drop tilde signs over eigenstate labels of the final Hamiltonian), (ii) 
the transition probabilities satisfy the detailed balance condition pn^m = Pm^n, and 
(hi) the only nonzero transition probabilities are between states 1 and 2 {pi2 = 712) and 
states 2 and 3 {p23 = 723)- From probability conservation, we must have pu = 1 — 712, 
P22 = 1 - 712 - 723 and P33 = 1 - 723 

/1 _ 712 ^^2 0 \ 

P = 712 1 - 712 - 723 723 (109) 

\ 0 723 1 - 723 -/ 

Note that p is symmetric because of the detailed balance condition. Its eigenvalues are: 


Ao = I, Ai = 1 - 712-723 +Y7i2 + 723 “ 712723 , A 2 = I- 712 - 723 -Y7i2 + 723 “ 712723 

( 110 ) 

One can then see that unless either 712 = 0 or 723 = 0, that is, unless the transition 
matrix is block diagonal, |Ai|, IA 2 I < 1. As a result, for a repeated process, any probability 
distribution will relax to the eigenstate corresponding to the eigenvalue Aq = 1, which is 
nothing but the uniform probability distribution (1/3,1/3,1/3). 

5.2.2. Increase of the Diagonal Entropy Under Doubly Stochastic Evolution 

As shown above, any initial state evolving under a repeated doubly stochastic process 
approaches the “infinite temperature state”. This state is the one with the maximal 
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spread in the eigenstates of any final Hamiltonian. As we discussed in Sec. 3.4, a natural 


measure of the spreading of states, in the basis of a given Hamiltonian, is the diagonal 
entropy: 


Sd 


Y^Pnn Inp nri' 
n 


( 111 ) 


This entropy is maximized for the uniform occupation probability (which, as shown in 
Appendix!^ is an attractor) so one can anticipate that Sd can only increase under doubly 
stochastic evolution. 

The diagonal entropy has many interesting properties. For example, it coincides with 
the usual von Neumann entropy for stationary density matrices. In addition, the diagonal 
entropy can be viewed as the entropy of the time averaged density matrix. The diagonal 
entropy also sets a natural “distance” between the density matrix p and the infinite 
temperature density matrix. Indeed given two discrete distributions P and Q a natural 
distance between them, also known as the Kullback-Leibler (KL) divergence [198j . i 


Dkl{P\\Q) = Y,PnHPn/Qn). (112) 

n 


21 


It is straightforward to see that this distance is non-negative and that it is zero only 
when the two distributions coincide, that is, only when = Qn for all values of n. If 
we substitute Pn —)■ pnn and Qn —?■ l/P then 

DKL{Pnn\\p°")=Soc-Sd>0, (113) 

where Soo = ln(P) is the entropy of the infinite temperature state (the highest possible 
entropy). Therefore, an increase of the diagonal entropy is equivalent to decreasing the 
distance between the actual and the infinite temperature energy distributions. 

We prove next that doubly stochastic evolution leads to an increase of the diagonal 
entropy. First, recall that if a function is convex in a given interval then 

f{x)> fiy) + {x-y)f'{y) (114) 

for any x,y in that interval. In particular, if we chose the function f{x) = xln(x), which 
is convex for any x > 0, we obtain 

xln(x) — yln(y) > (x — ?/)[ln(y) -|- 1]. (115) 

By replacing x —)> pnn and y —)> we obtain 

p(0) - p’^}^ Inp*--^- > (p^^"> - ^ ( Inp^-^l -k 1^ (116) 

Multiplying both sides of the equation above by Pn^m, and summing over n, leads to 

Pnl In p® ) - pJjlJf, In pJ3jj^ > 0, (117) 


^^Some caution is needed here as the Kullback-Leibler divergence is not symmetric and does not satisfy the 
triangular inequalities. Therefore, it is not a distance in the metric sense. 
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where we have used that = 1 and YnPnnPn^m = Finally, summing this 

inequality over m and using that YmPn^rh ~ oae obtains 

AS, ^ S« - Sf = > 0 . ( 118 ) 

n fh 


This implies that, under any doubly stochastic evolution, the diagonal entropy can only 
increase or stay constant. Hence, the distance from the uniform or infinite temperature 
distribution monotonically decreases or stays constant. It is interesting that this state¬ 
ment is not tied in any way to quantum chaos. For example, if we take an arbitrary 
two level system in a stationary state and apply any sequence of pulses then the diago¬ 
nal entropy cannot decrease. This statement, however, does not hold if the initial state 
is nonstationary. In that case, the evolution is not doubly stochastic and the diagonal 
entropy can decrease. 

The increase of the diagonal entropy under doubly stochastic evolution should be 
contrasted with the exact conservation of von Neumann’s entropy S^-a = ~Tr [p In p] 
under any unitary evolution. These two results do not contradict each other. In fact, the 
relation between these two results can be understood by considering a unitary evolution 
of an initially stationary density matrix. Then the following chain of relations hold: 

5d(0) = 5vn(0) = 5vn(i) < Sd{t) . (119) 


The first equality follows from the fact that, for a stationary density matrix, the von 
Neumann and the diagonal entropy are identical. The second equality reflects the obvious 
fact that under unitary evolution the von Neumann entropy is conserved. Finally, the last 
inequality follows from Eq. (118). This has a direct analogy in classical systems where 
Liouville’s theorem conserves the volume in phase space while the total entropy of an 
isolated system increases or stays constant. 

The fact that 


Sd{t) > Sd{0), 


( 120 ) 


means that, under unitary evolution starting from a stationary state, the diagonal entropy 
at any time t > 0 is larger than (or equal to) the initial diagonal entropy. This does not 
mean that the diagonal entropy increases continuously in time, that is, in general it is not 
true that Sd{t 2 ) > Sd{ti) for t 2 > h > 0 because at intermediate times the system might 
retain coherence. A monotonic increase occurs if we consider repeated doubly stochastic 
processes, as discussed in Sec. |5.1.T One can also prove a more general statement without 
assuming any dephasing, namely, that if one waits for a fixed long time between two pulses 
the probability that the diagonal entropy increases in time is higher (exponentially higher 
for many particles) than the probability that it decreases. The proof of this statement is 
beyond the scope of this review and can be found in Ref. |195j . 


5.2.3. The Second Law in the Kelvin Formulation for Passive Density Matrices 

As shown above, under repeated doubly stochastic evolution that starts from a station¬ 
ary density matrix, the diagonal entropy increases until it reaches its maximum value, 
corresponding to an “infinite temperature state”. 

Now, we take a step further and assume that the initial probabilities decrease mono- 
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tonically in energy, that is, for any n and m 

{p^l-p^±){En-Em)<Q^ ( 121 ) 

where En and Em are eigenenergies of the system. Relying on this assumption, one can 
prove that, for any doubly stochastic cyclic evolution (in particular, for any cyclic unitary 
process), the energy of the system can only increase or stay constant jl88l I199| : 

Y,Er,p^^l>Y,E^p^^l ( 122 ) 


By energy conservation, this difference must be equal to the average work done on the 
system during the cyclic process 

(IV) = EnP^^l > 0 . (123) 


Diagonal density matrices satisfying the inequality (121) are termed passive |188j and are 
common. The Gibbs distribution for systems in thermal equilibrium, p^n = is a 


passive density matrix. Therefore, condition (123) is quite general and can be interpreted 
as a manifestation of the second law of thermodynamics in Kelvin’s formulation - one 
cannot extract work from a closed equilibrium system by carrying out a cyclic process. 


As all the results in Sec. 5^ this statement is solely based on doubly stochastic evolution 
and on the passivity of the initial density matrix (and therefore, applies to both integrable 
and nonintegrable systems). In Sec.j^ we show explicitly how it works for a single particle 
driven in a chaotic cavity. 

The proof of Eq. ( 123| ) relies on the fact that any doubly stochastic evolution tends 
to make the occupation probabilities uniform. In the case of an initial passive density 
matrix, this process requires a transfer of probability from low- to high-energy states 
causing the energy of the system to increase. If a stronger detailed balance condition 
is satisfied, that is, pn^m = Pm^n for any m, n, then the proof becomes particularly 
simple |192| : 


(IE) — ^ ^ EnPn^m[pirim Pnn] ~ ^ ^ ^^ Pn^mjEn 


nnJ > 0. 


(124) 


The second equality follows from symmetrizing with respect to n and m and using the 
detailed balance condition. However, in general, pairwise transition probabilities are not 


the same and only the sum rule (91) is satisfied. In this case, the proof is more complicated 
but still straightforward |188] . For completeness, it is presented in Appendix [E} 


5.3. Implications of Doubly-Stochastic Evolution for Chaotic Systems 


In the previous three subsections we discussed three important results that are all man¬ 
ifestations of the second law of thermodynamics. In Sec. 5.2.1 (and Appendix B , we 
showed that the “infinite temperature state” is the only generic attractor of a doubly 
stochastic evolution. In Sec. |5.2.2t we proved that under a repeated doubly stochastic 


evolution the diagonal entropy increases until it reaches its maximum value, which corre¬ 
sponds to that of the “infinite temperature state”. Finally, in Sec. |5.2.3t we proved that 
any cyclic doubly stochastic evolution leads to an increase of the (average) energy of the 
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system (provided the initial density matrix is passive). This statement is equivalent to 
the second law of thermodynamics in the Kelvin form. 

All these statements rely only on doubly stochastic evolution and therefore apply to 
both integrable (and in particular noninteracting) and chaotic systems. In this section, 
we take a step further and assume that the system undergoing the dynamical process is 
chaotic. 


5.3.1. The Diagonal Entropy and the Fundamental Thermodynamic Relation 

The entropy of a system in thermal equilibrium is a unique function of its energy and 
other relevant extensive variables (denoted by A). This fact implies the fundamental 
thermodynamic relation 


dS = - {dE + FxdX). 


(125) 


Here, Ex is the generalized force conjugate to A. Since this expression is directly derived 
from S{E,X), it applies to both reversible and irreversible processes. In the case of a 
reversible transformation, we can identify TdS as the heat transferred and FxdX as the 
work done by the system. In contrast, for an irreversible transformation, one cannot make 


these identifications. Equation (125) is then taken as a mathematical relation between 


thermodynamic functions. For example, if an isolated gas in a container is expanded by 
a volume dX by moving a partition very quickly (a Joule experiment), there is no work 
done and the energy change in the system is zero. The fundamental relation then implies 
that the change in entropy is given by dS = FxdX/T with being the pressure before 
the expansion. If one insists on giving an interpretation to the equation as describing 
a dynamical process, it can be thought of as a hctitious reversible process (expansion 
with work and heat exchange) that is not related to the actual (irreversible) process 
that has taken place. As we show in this and the following subsection, thinking about 
the fundamental relation from a microscopic point of view is illuminating. The changes 
in the entropy can actually be assigned to underlying, in general irreversible, physical 
processes (which may result from work and/or heat exchange). The entropy change is 
then simply related to transitions between the energy levels of the system during the 
dynamical process. 

From the microscopic point of view, the fundamental relation is not at all trivial (see, 
e.g., Ref. |200 ] ). The energy and its change are uniquely defined by the density matrix 
and the energy eigenstates. The generalized force is also expressed through the density 
matrix and the Hamiltonian. Thus, for the fundamental relation to apply microscopically, 
we need to define an object, the entropy, which can also be expressed through the density 


matrix and possibly the Hamiltonian and ensure that Eq. (125) holds for any dynamical 


process both for open and isolated systems. Let us show that the diagonal entropy, 
which we defined earlier as the measure of delocalization in the energy space, satisfies 
the fundamental relation in chaotic systems |125( 1134] . As we will see, Sd satisfies 


dE = TdSd - FxdX, 


(126) 


for both reversible and irreversible processes. Once we identify Sd with the entropy, this 
constitutes the fundamental relation. 

To derive the fundamental relation, let us first use standard statistical mechanics and 
come back to the role of quantum chaos later. We assume that the initial density matrix 
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is described by a Gibbs distribution (the extension to other ensembles is straightforward) 

Pnm{X) = (127) 

Using that the energy of the system is given by E[X) = Yhn PnnEni^)■, and calculating 
its change for an arbitrary ‘small’ dynamical process (we are not assuming here that the 
system is isolated or that the process is unitary), we find 


dE{X) = di^PnnEn{X)] = 


dEn 

En{X)dp 

nn Pnn dX 


'£E„{X)dp 

nn FxdX, 


(128) 

where Ex = — pnndEn{X) / dX. Next, we compute the change in the diagonal entropy 

for the same process. This gives 


dSd = -d 


pnn HP n 


nn ln(p nn) ^ ^ dpfifi — En{X)dp 

nn; (129) 


where we used that, by conservation of probability, Yn dpnn = 0. Comparing Eqs. (128) 
and (129), and noting that the generalized force can be also written as 


Fx = - 


dE{X) 


dX 


(130) 


Sd 


we recover that the diagonal entropy indeed satisfies the fundamental thermodynamic 


relation, Eq. (126), for any dynamical process. Remarkably, under the assumption that 


the system is initially described by the Gibbs distribution, the fundamental relation 
applies exactly for both large and small systems whether they are open or closed during 
the dynamical process. Moreover, it applies to integrable and nonintegrable systems alike. 

To see where quantum chaos enters, assume that an isolated system undergoes a quench 
(or any other dynamic process) protocol. Then, according to ETH, physical observables 
after relaxation are described by an equilibrium thermal ensemble. This is true despite 
the fact that the density matrix of the entire system is not that of the Gibbs ensemble. 
With this in mind, we want to prove that if the system is chaotic then the fundamental 
relation holds up to possible subextensive corrections. The easiest way to prove it without 
assuming a standard equilibrium density matrix is to show that the diagonal entropy, 
which is a function of the density matrix, coincides with the thermodynamic entropy 
up to subextensive corrections. Then, the fundamental relation and its generalization 
immediately follows. Recall that we already presented numerical evidence that the di¬ 


agonal entropy coincides with the thermodynamic entropy in Sec. 3.4, when discussing 


implications of quantum chaos and RMT to delocalization in energy space (see Fig. 12). 


We start our discussion by noticing that, for large system sizes (no matter whether 
they are in a pure or in a mixed state), the diagonal entropy can written as an integral 
over energies 


Sd--J dEn{E)p{E) ln[p{E)], 


(131) 


where p{E) 
many-body 
function in 


is an interpolating function satisfying p{En) = Pnn and Q{E) is the smoothed 
density of states. We note that p{E)Q{E) = P{E) is the energy distribution 
the system [see Eq. (56)], from which all moments of the energy can be 
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computed. For example, the average energy is given by {E) = f dE E P{E). One can 
rewrite the diagonal entropy as 


Sd 


dEP{E) In 


P{E)SE 

n{E)5E 


= j dEP{E)Sm{E) 


dEP{E)\n[P{E)5E], (132) 


where Sm{E) = ln[U(Fl)(5Fl] is the microcanonical entropy at energy E { 5 E is the width 
of the microcanonical energy window). 

The last term in Eq. (132) is the one that exhibits a qualitatively different behavior 


in integrable and chaotic systems |125j . In nonintegrable systems, one expects P{E) to 
be a smooth function of the energy (see. Sec. 3.4). As a result, f dEP{E) \nP{E) is not 
extensive because f dEP{E) is normalized to one and the width of the energy distribution 
is not exponentially large in the system size (see Sec. |4.3.2 ). In integrable systems, on 
the other hand, P{E) after a dynamical process (such as a quench) generally exhibits 
large fluctuations (see, e.g., left panels in Fig. [IT|). As a result, the last term in Eq. (132) 


can be extensive and, therefore, comparable to the contribution of the first term, that 
is, Sd can differ from the thermodynamic entropy. Numerical studies have indeed found 
that Sd agrees (disagrees) with the thermodynamic entropy in quenches in nonintegrable 
(integrable) systems [1251113011135) (for results at integrability, see Sec. 8.3). 

Actually, if P{E) after a dynamical process is well approximated by a smooth Gaussian 
(expected for sufficiently large nonintegrable systems, see Sec. 3.4) 


PiE) 


1 


V2 


: exp 




{E-{E)f 

2ct2 


(133) 


with being, at most, extensive (for a discussion of after a quench, see Sec. 4.3.2), 


then the fact that Sd agrees with the thermodynamic entropy follows straightforwardly. 
To show that, let us expand Sm{E) around the mean energy {E) 


Sm{E) ^ Sm{{E)) + 


dSm{E) 


dE 


(E) 




2 5E2 


(E) 


(E-(E)f + ... . (134) 


By substituting Eqs. (133) and (134) into Eq. (132), and computing the Gaussian inte¬ 
grals, we obtain 


Sd ^ S^m) - - l) ’ (135) 

where Sm{{E)) = ln[II((E))\/^(T] is the von Neumann entropy of a microcanonical 
distribution with mean energy (E) and energy width SE = In the expression 

above 


= - 


dp{E) 


dE 


{E) 


with I3{E) = 


dS{E) 

dE 


(136) 


We note that here the inverse temperature (3{E) is defined solely by the density of states 
at energy E, and that is the variance of the energy in a canonical ensemble with inverse 
temperature /3((E))J^ Since is extensive, the last term in Eq. (135) is clearly non- 
extensive and can be ignored in large systems. We then see that, in chaotic systems, one 


assume that we are not at a phase transition, at which might diverge. 


61 





































August 2, 2016 


Advances in Physics Review 


can define a functional of the density matrix (the diagonal entropy) that coincides with 
the thermodynamic entropy both in open and closed systems after they are driven from 
equilibrium and allowed to relax. For closed systems, this is a nontrivial statement that 
relies on the assumption that the final Hamiltonian (the one after the dynamical process) 
is quantum chaotic (nonintegrable). From this result, the fundamental thermodynamic 
relation for chaotic systems follows without the assumption that the system is in thermal 
equilibrium. 

To conclude this section, let us mention an apparent paradox that is frequently raised 
to argue that there is a deficiency in the diagonal entropy (or, for that matter, von Neu¬ 
mann’s entropy) for quantum systems. A similar “paradox” can be argued to occur for 
Liouville’s entropy for classical systems. If one starts a cyclic process from an eigenstate 
of an ergodic Hamiltonian, where von Neumann’s entropy is zero by definition, after 
reaching the new equilibrium the energy change can be made arbitrarily small while 
the entropy change cannot. The latter will be the thermodynamic entropy. Hence, the 
equality dE = TdS seems to be violated. There is, in fact, no paradox. The entropy of 
a single eigenstate is a singular quantity. Any arbitrarily small perturbation will imme¬ 
diately lead to mixing exponentially many eigenstates of the Hamiltonian and lead to 
the thermodynamic entropy (see, e.g., the discussion in Ref. |109j l. In particular, any 
attempt to measure the temperature of the eigenstate, which is necessary to test the 
fundamental relation, will introduce an extensive thermodynamic entropy and thus the 
paradox is immediately removed. 

5.3.2. The Fundamental Relation vs the First Law of Thermodynamics 

It is interesting to put the results presented in the previous subsection in the context of 
the first law of thermodynamics: 


dE = dQ + dW. 


(137) 


This law is a statement about energy conservation and implies that the energy of the 
system can change only due to heat (defined as the energy flow from one system to another 
at fixed macroscopic couplings) and work (defined as the energy change in the system 
due to a dynamical change of these couplings). Note that, as previously stressed, the 


fundamental relation (125) is a mathematical expression relating equilibrium quantities. 


while the first law only deals with the conservation of energy. 

From the microscopic stand point it is convenient to split an infinitesimal energy change 


into two contributions [see Eq. (128)]: 


dE = dQ + dW^. 


(138) 


The first one, dQ = Ylm^ndpnn-, results from changes in occupation numbers of the 
microscopic energy levels (and is not to be confused with the common definition of heat 
in the first law) and the second one, dVFad = results from the changes 

of the energy spectrum at fixed occupation numbers. This last term, as we discussed, 
can be written as the full derivative of the energy (assuming that the energy spectrum 
is differentiable) and thus represents the adiabatic work done on the system, dlTad = 
—E\dX [see Eq. (130)]. If the dynamical process is infinitesimally slow, changing the 
macroscopic parameter of the system does not change occupation probabilities. Formally, 
one can prove this statement using adiabatic perturbation theory, similarly to what was 
done for energy in Ref. [201) . We thus see that for infinitesimally slow processes dQ = 
TdS = dQ and dW = dW^d, which is well known from thermodynamics. We note that in 
large systems the strict quantum-mechanical adiabatic limit requires exponentially slow 
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processes in order to suppress transitions between many-body eigenstates. This is another 
way of saying that isolated eigenstates are very fragile. Thermodynamic adiabaticity on 
the other hand requires that the dynamical process is slow with respect to physical time 
scales, which are much shorter than the inverse level spacing. This is of course consistent 


with Eq. (138) as transitions between nearest eigenstates lead to exponentially small 


heating and essentially do not contribute to dQ and hence to dE. So the only way to 
have significant heating is to introduce transitions across exponentially many energy 
levels, which requires much faster dynamics. 

The situation becomes somewhat different in a setup where the process is not infinites¬ 
imally slow, even if it is still effectively quasi-static. For example, one can imagine a 


compression and expansion of a piston containing a gas (see Fig. 20) at some finite rate. 


At the end of the process, when the piston is back at its original position, the energy 
of the gas is higher than its initial energy. This is essentially the way microwave ovens 
function. There, the food heats up because of the non-adiabatic work performed by the 
time-dependent electromagnetic field. This process can still be quasi-static because if in 
each cycle the energy of the system increases by a small amount then it can be approx¬ 
imately described by local equilibrium. This “microwave heating” in ergodic systems is 
indistinguishable, at the end of the process, from conventional heating resulting from 
connecting the system to a thermal reservoir. 

Therefore, from a microscopic standpoint, it is more natural to define dQ (and not dQ) 
as heat. In the literature, dQ has been called heat |192j . excess heat |202) . excess energy 
|20^ . non-adiabatic work [M], and others. We will not argue one way or another in 
terming dQ. We only note that, physically, it represents the heating of the system, that 
is, the energy change in the system caused by transitions between levels, independent of 
whether those transitions are induced by contact with another system or by changing 


non-adiabatically some coupling A, or both. As we proved earlier [see Eq. (126)], for small 
changes in ergodic systems, one always has dQ = TdSd, so this energy change is uniquely 
associated with the entropy change. If an isolated system starts in the stationary state 
then, as we proved, the entropy change is always non-negative and thus one always has 
dQ > 0. This, in turn, implies that dW > dlTad (in the latter dQ = 0), in agreement 
with the results in Sec. 5.2.3P^ 


If the system is not closed, then, by definition. 


dQ = dQ + {dW - dWad). 


(139) 


As a result, dQ > dQ so that TdSd > dQ, as expected from thermodynamics. 


6. Quantum Chaos, Fluctuation Theorems, and Linear Response Relations 

The recently discovered fluctuation theorems are remarkable equalities involving thermo¬ 
dynamic variables. They are valid for systems initially prepared in equilibrium and then 
driven far from equilibrium in an arbitrary way (see, e.g.. Refs. [SSIIM] for reviews). 
These theorems effectively replace many thermodynamic inequalities by equalities (e.g., 
the second law of thermodynamics in the Kelvin form discussed previously). In many 
cases, the proof of the fluctuation theorems, as previously done for the fundamental rela¬ 
tions, assumes that the initial state is described by a Gibbs distribution. When this is the 
case, one does not need any additional assumptions, such as quantum chaos. However, 
if the system is not weakly coupled to an equilibrium bath, then the assumption of a 


^^For processes that are not infinitesimally slow, in the presence of unavoidable level crossings, there can be 
exceptions where dW < [204] 
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Gibbs distribution is often not justified and one has to rely on quantum chaos and ETH 
to prove these relations. 

In this section, we derive the fluctuation theorems for individual eigenstates and hence 
extend them to arbitrary stationary distributions that are narrow in energy. Based on 
these fluctuation theorems, we derive energy drift-diffusion relations for both isolated 
and open systems, and discuss how they lead to nontrivial asymptotic energy distribu¬ 
tions for driven isolated systems. For clarity, we derive these fluctuation relations in two 
ways. First we show a standard derivation for an initial Gibbs ensemble, and then, for 
quantum chaotic systems, we generalize this derivation to systems prepared in individual 
eigenstates. The latter approach clarihes in which situations fluctuation theorems apply 
to isolated systems. It also allows us to derive finite-size corrections and to extend them 
to open systems that are strongly coupled to a bath. 


6.1. Fluctuation Theorems 

Particularly simple proofs of fluctuation theorems are obtained by considering isolated 
quantum systems initially prepared in contact with a thermal bath at temperature T. 
The bath is then disconnected from the system which undergoes a unitary (or, more 
generally, doubly stochastic) evolution in response to an external protocol that changes 
some macroscopic parameter in time. The protocol has a specified duration after which 
the parameter is kept constant and the system is allowed to relax back to equilibrium. 

Thermodynamics tell us that the average external work, W, done on the system during 
a thermodynamic protocol is bounded from below by the difference in the equilibrium 
free-energies (at the same temperature T) evaluated at the initial (A) and final (B) value 
of the control parameters. Specihcally, 


{W) >AF = Fb,t - Fa;_ 


(140) 


Because the system is isolated, there is no heat flowing to the system, and according 


to the first law of thermodynamics {W) = IPad + Q, where Q [introduced in Eq. (138) 
in the previous section] is the irreversible work or, microscopically, the energy change 


associated with the transitions between different energy levels. Then Eq. (140) becomes 


Q > 0. 


(141) 


For a cyclic process, W^d = 0. Therefore, {W) = Q, and this inequality reduces to 
Kelvin’s formulation of the second law. For an adiabatic process, <5 = 0, and the in¬ 


equality (141) becomes an equality. 


By properly taking into account the fluctuations, the Jarzynski equality turns the in¬ 


equality (140) into an equality even if the protocol drives the system far from equilibrium. 


This equality reads: 


l^^-WlksT^ ^ ^-AF/ksT^ 

where the angular brackets denote the average over many experimental realizations of 
the same protocol. In particular, for cyclic processes (for which A = B), the Jarzynski 
equality reduces to 


(e 


-WlksTx 


= T 


(143) 


which was hrst discovered by Bochkov and Kuzovlev |205[ 1206] as a nonlinear general¬ 
ization of the fluctuation-dissipation theorem. Equation (143) is frequently referred to as 
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the Bochkov-Kuzovlev work-fluctuation theorem 


To clarify the meaning of Eq. (142), it is better to refer to a concrete example, say, the 


compression of a gas by a moving piston (see Fig. 20). Initially, the gas is assumed to be 


in thermal equilibrium connected to a bath, with the piston at position z(0) = A. We 
assume that, during the protocol z(t), the system is not connected the bath anymore, i.e., 
it can be regarded as isolated. At the end of the protocol, the control parameter reaches 
the value z{t) = B. We record the external work TV, which is formally defined as the 
energy change of the piston. Note that the free energy is not the free energy of the 
system after the protocol. It is rather the equilibrium free energy evaluated at the initial 
temperature and the final value of the control parameter!^ Upon repeating the protocol 
many times, the work will fluctuate around some average value. The Jarzynski equality 


(142) states that the exponential of the work done, averaged over many realizations of 


the same experiment, is equal to the exponential of the equilibrium free energy difference. 
Hence, the Jarzynski equality connects a dynamical quantity, work, which depends on 
the details of the protocol, and an equilibrium quantity, the free energy difference, which 
only depends on the initial and final values of the control parameter. In particular, this 
relation can be used to measure free energy differences in small systems by measuring 
the work. In large systems, the Jarzynski relation is generally not very useful unless W 
is small. This because the average of the function exp[—/3iy] will be dominated by rare 
events in which the work is negative. The assumption that the system is not connected 
to the bath during the protocol, which was present in the original work of Jarzynski |2fl] 
and which caused some confusion, is not necessary (see Ref. |207) and the proof below). 

Equation (142) can be understood as a constraint on the work distribution P iW). This 
constraint is independent of the details of the protocol z{t), and depends only on the 
initial and final values of the macroscopic parameter, A and R, respectively, through the 
free energy difference AT = Fb^t — Fa,t- Note that the full distribution P{W) depends 


on the details of z{t). If we take the logarithm of Eq. (142), and perform the cumulant 
expansion, we obtain the expression: 


E 

n>l 


(_l)n 


n\ 


{ksTy 


AF 


(144) 


We therefore see that the Jarzynski equality constraints different cumulants of the work. 
If the work is small, or if the temperature is high, then only a few cumulants effectively 
enter the sum. When this happens, the constraint has an important consequence and 
leads to standard thermodynamic relations. However, if many cumulants contribute to 
the expansion (as expected when W is large), then the constraint does not place strong 
restrictions to the moments of the work. 


Note that, by combining the Jarzynski equality (142) and Jensen’s inequality (exp[x]) > 
exp[(x)], we recover the Clausius inequality (140). Let us emphasize that this inequality 
only applies to the average work. The work carried out in a single realization of the 
experiment can be smaller than AF. This can be the case especially in small systems, 
where fluctuations are large. The Jarzynski equality allows one to estimate the likelihood 
of such rare events [207) . 

Closely related to the Jarzynski equality is the Crooks theorem, which was originally 
formulated for classical systems [186) and then extended to quantum systems [231 [24] . 
The Crooks theorem relates the probability of performing a work W during the forward 
process, Pf{W) {A —?■ B), with the probability of performing a work —W during the 


would be the free energy of the system if it is reconnected to the same bath and allowed to re-equilibrate. Since 
no work is done on the system during its re-equilibration, such a process does not affect the Jarzynski equality. 
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reverse process, Pr{—W) {B —?• A, in a time-reversed manner): 


Pf{W) 
Pr i-W) 


^ ^(W-AF)lkBT 


( 145 ) 


The expression above can be interpreted as a symmetry of the distribution function 
between the forward and the reverse process. Note that this symmetry is with respect to 
W = 0 and not with respect to the average work. Rewriting the Crooks relation as 


Pf (W) = Pji {-W) 

and integrating over W, one recovers the Jarzynski equality. 

6.1.1. Fluctuation Theorems for Systems Starting from a Gibbs State 


(146) 


Here we derive the Jarzynski equality (142) and the Crooks theorem (145) for a system 


that is initially in a Gibbs state and is not coupled to a bath during its evolution. In this 
case, neither quantum chaos nor the limit of a large system size need to be invoked. The 
derivation relies exclusively on the symmetry of the doubly stochastic evolution between 


the forward and the reverse process [see Eq. (104)]: 


Prh^n Pn^rh' 


(147) 


We recall, that a doubly stochastic evolution describes unitary dynamics of systems 
starting from stationary states and extends to some non-unitary processes involving 
projective measurements or dephasing. 

Let us consider a system prepared in a state characterized by an initial energy E^, 
corresponding to an initial value of the control parameter A(0) = Xa, drawn from the 
Gibbs ensemble of the initial Hamiltonian. Then the system undergoes an arbitrary 
dynamical process described by a doubly stochastic evolution. At the end of the process, 
the system has an energy Eb, which is a random variable. The fluctuating worl^^ is 
formally defined as IT = Eb — Ea 
distribution: 


This work is characterized by the probability 


Pf{W) = pW Pn^rn5{EF, - E^ 

n,m 


n,fh 


e 


-0E, 


Za 


Pn^fh 


5{Efn - En 


W), (148) 


where n and (m and E^) refer to states and the spectrum of the initial (final) Hamil¬ 
tonian, and Za is the partition function associated with the initial value of the control 
parameter Ay 4 . The probability of performing work —W during the reverse process, start¬ 
ing from a Gibbs distribution, is 


Pni-W) = Y 


Zb 


Y^^,,5iEn-Ern + W) 


„-/3(E„+VF) 

6{EF^ -En-W) = Pf{W) e-^^ -Y, (149) 


n,m 


Zb 


Zb' 


^^There is an active discussion on how to define work in quantum systems, with many conflicting definitions. We 
use the definition due to Kurchan m and Tasaki m, which has a transparent physical meaning, namely, the 
energy change in the system. By energy conservation, W is also the energy change of the macroscopic degree of 
freedom associated with the control parameter. 
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where is the partition function associated with the thermal equilibrium distribution 
at the final value of the control parameter, that is, Xb- Writing the free energy difference 
as AF = Fb{T) — Fa{T), and using the relation between free energy and partition 
function, that is, Za,b = , one sees that Eq. (149) is nothing but the Crooks 


theorem (145). 


For cyclic symmetric protocols, for which the reverse process is identical to the forward 
process, the Crooks theorem simplifies to 


P{W)e-f^^ = P{-W), 


(150) 


where we suppressed the indexes F and R since, in this case, they are redundant. In 
turn, this relation can be recast in the form of a symmetry relation for the cumulant 
generating function G{Q): 


G(C) = In 


dWP(W)e-‘^^ 




(-cr 


n=l 


n! 


To see this we multiply both sides of Eq. (150) by 

P(W)e-^^e^^ = P{-W)e^^, 

and integrate over W to obtain 


(151) 


(152) 


G(/3-C) = G(C). (153) 

The generating function formalism is a convenient tool for deriving various linear re¬ 
sponse relations, for example, Onsager relations and their nonlinear generalizations (see 
Ref. |2U8j and the discussion below). 

6.1.2. Fluctuation Theorems for Quantum Chaotic Systems 

Now, let us focus on eigenstates of many-body chaotic Hamiltonians and derive the 
corresponding fluctuation relations for isolated systems. The applicability of fluctuation 
relations to individual eigenstates allows one to extend them to arbitrary initial stationary 
distributions so long as they are sufficiently narrow. The approach based on individual 
eigenstates also allows us to derive the leading hnite-size corrections to the cumulant 
expansion of these relations and prove these relations for open systems, even if they are 
strongly coupled to the bath throughout the dynamical process. 

Let us analyze the probability of doing work W during the forward process starting 
from a given many-body energy eigenstate |n). By definition, this is given by 

PpiPn ^En + W) = PpiEn, LE) = S{Ern - E^ - W) 

fh 

= J dEnBiE)p{En^ E)6{E-En-W), (154) 


where is the density of states at the hnal value of the control parameter and we 
used the fact that, for chaotic systems, the probability Pn^m ~ p( En —> E) is a smooth 
function of the energy Em, up to a small Gaussian noise (c.f.. Sec. 3.4). In non-chaotic 
systems, the transition probability Pn^m can fluctuate strongly between states that are 
close in energy, that is, changing the summation over fh by an integration over E is not 
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1 > ^ 


\n) - ^ 

h) 




-► A 

Figure 22. Illustration of relation i |i^ . A system is composed of many closely spaced energy levels. In response 
to some dynamic process, during the forward process (red arrow) the system undergoes a transition from an initial 
state |n) to a final state |m) with the energy = En-\-'W. During the reverse process (green arrow), the opposite 
transition happens. The ratio of probabilities of doing work W for the red process and —W for the green process 
is given by the ratio of the density of states of the corresponding final states [see Eq. 


justified and, in general, is not valid. Integrating the expression above over the energy, 
we find 


PpiEn ^En + W)= p{En ^ En + W) QsiEn + W). 


(155) 


Using similar considerations, we find that the transition probability for doing work 
— W during the reverse process starting from state \rh) is: 

PRiErn ^Ern-W) = Pr{E^, -W) = 5{En -E^ + W) 

n 

= ^ Pn^rhKEn - E^ + W) = f dEQ.A{E)p{E Em) 5{E - Em + W) 

n 

= p{Em -W^ Em)nA(Em - lU). (156) 

Comparing the expressions for the forward and backward processes, and substituting 


Er, 


E and 


—^ E + W, we obtain 

Ef(E, W) ^ ^b{E + W) ^ Sb{e+w)-Sa{e) 
Pr{E + W,-W) Oa(E) 


(157) 


which is known as the Evans-Searles fluctuation relation |209j (for classical derivations of 
a similar nature, see Ref. |210j l. This result tells us that the ratio of probabilities of doing 
work W in the forward process and — VU in the reverse process is simply equal to the 
ratio of the final and initial densities of states, that is, the ratio of the number of available 
microstates. Typically, as schematically illustrated in Fig. ^P2\ the density of states is an 
exponentially increasing function of the energy (corresponding to positive temperature 
states). Hence, the number of available microstates is larger for processes corresponding 
to an energy increase. This asymmetry is the microscopic origin of the higher probability 
of doing positive work, despite the equivalence of the forward and backward microscopic 
rates. Note that these considerations are only valid for chaotic systems. For integrable 
systems, not all microstates might be accessible and one needs to refine the argument. 


Relation (157) is very general. In particular, it contains fluctuation theorems and, in 
a sense, generalizes them. To see this, consider a total system with N particles and 
let us assume that a dynamical process is applied only to a small subsystem with Ni 
particles, such that Ni is kept fixed as N ^ oo {Ni can be arbitrarily small, e.g., just 
one particle, or can be macroscopic). If the subsystem is weakly coupled to the rest of 
the system, it is expected that the rest of the system will act as a thermal bath and, as 
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system I system II 



Figure 23. Schematic representation of the setup considered to prove the detailed balance condition, Eq. | |166[ l. 
A system I, which can be either quantum chaotic or not is coupled to another system II (bath), which is quantum 
chaotic. The two systems are weakly coupled and are allowed to exchange energy. The microscopic transition 
probabilities from a pair of states \ni,nii) to another pair of states (red arrows) is the same as the 

probability of the reverse process (green arrows). However, if one is only interested in transition probabilities in 
system I, irrespective of the outcome in the bath, this symmetry is broken and one obtains the standard equilibrium 
detailed balance. 


discussed previously, from ETH we expect that the small subsystem is described by the 
Gibbs ensemble. In this case, we are back to the standard setup considered in Sec. |6.1.1 


On the contrary, when the subsystem is strongly coupled to the rest of the system the 
assumption about the Gibbs distribution is not justified. However, even in this case, 
the Grooks theorem (145) (and, hence, the Jarzynski equality) still applies. The proof is 
straightforward. Since W is at most proportional to Ni and it is therefore non-extensive 
in N, we can expand the entropy in Eq. (157) to the leading order in W: 


Sb{E+W)-Sa{E) = -^W+Sb{E)-Sa{E)+0{Ni/N) = /3W-pAE+0{Ni/N), 

oh/ 

(158) 


where we used the standard thermodynamic result that at constant temperature 


- /3AF = S{E, Xb) - 5(E, Aa) = Sb{E) - Sa{E). (159) 


Therefore, up to Ni/N corrections, Eq. (157) implies that 


PFiE,W) 
Pr{E + W, -IT) 


^ ^/3{W-AF) 


(160) 


irrespective of whether the subsystem is described by the Gibbs ensemble or not. Note 
that the free energy difference here is that of the whole system. It becomes the free 
energy difference in the subsystem only for a weak coupling between the subsystem and 
the rest of the system. 

If the work IT is small but extensive, for example, coming from a global protocol, or 
if one does not take the thermodynamic limit, one can still perform a Taylor expansion 
of the entropy in Eq. (157) in powers of IT. Then, corrections to the Grooks relation 
will be generally finite. In particular, in Sec. 6.3, we will discuss how such corrections 


affect the Einstein drift-diffusion relation for the energy current (as occurs, for example, 
in microwave heating) of a driven isolated system, and to the Onsager relations. In what 
follows, we discuss various implications of Eq. (157) to setups involving both isolated 
and open systems. 
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6.2. Detailed Balance for Open Systems 


Let us use the general relation (157) to derive the familiar detailed balance condition 
for open systems. Imagine we have two weakly coupled systems I and II. We do not 
need to make any assumption about system I. It can be arbitrarily small, for example, 
consisting of one spin, and can be either integrable or ergodic. We assume that system 
II is quantum chaotic. For convenience, we call system II a bath, but we emphasize 
that we do not assume that system II is much bigger than system I. The weak coupling 
assumption is made explicit by writing the Hamiltonian of the entire system as 


Htot = Hi + Hu + ^Hi-ii, 


(161) 


with 7 small. Initially, the two systems are prepared in a stationary state with respect to 
the uncoupled Hamiltonian Hi + Hu and then they are allowed to interact. The initial 
state can be the product of two Gibbs states with different temperatures or can be the 
product of two eigenstates. Because of the coupling, energy is allowed to flow between the 
two systems. If the coupling is weak, then the sum of the energies of systems I and H is 
(approximately) conserved. This implies that only microscopic transitions between states 
|ni,nii) and |mi,mii) satisfying the energy conservation = A 


■'mi 


+ are 


allowed (see the red arrows in Fig. 23). This setup is formally equivalent to a quench in 


the coupling 7 . Therefore, the transition probabilities are doubly stochastic. Moreover, as 
we discussed in the previous section, within the Fermi golden rule (or because this quench 
can be viewed as a time-symmetric protocol) these probabilities satisfy the stronger global 
detailed balance condition 




(162) 


Suppose that we are interested in the transition between microstates only in system I, 
irrespective of the outcomes in the bath H. Hence, we have to sum over all final states 
of the bath. Using that in quantum chaotic systems the transition probabilities are the 
same for nearby eigenstates 

Pl{ni —>■ mi) = ^ ] Pni,nu^mi,mu = Pni,nii—>-mi,miif^Il(L/„^jj). (163) 

mil 


This equation is a direct analogue of Eq. ( 155| ), with the solely difference being that 
only the density of states of the bath enters the RHS, as the sum is carried out over the 
final states of the bath. For the reverse process (green arrows in Fig. 23), using the same 
arguments, we find 


Pl(mi —>■ ni) — Pmi,mu—>-ni,nii^n(E^^^) 


(164) 


Comparing these two results, using the conservation of energy together with the global 
detailed balance condition, and simplifying notations E^ —?■ E^^ (162), we find 


Pl(ni —)■ mi) _ -|- Ej^^ — E^J _ 

pi(mi ni) IIii(Eii) 


(165) 


If the bath is large, or if the energy change = Ej^^ — E^^^ is small compared to E^^, 

as before, one can expand the entropy difference in the energy change. By doing that. 
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one recovers the detailed balance condition in its most familiar form m- 

Pijni —)■ mi) ^ 
pi{mi m) 

where l3ii{E^^) is the temperature of the bath corresponding to the energy 


( 166 ) 


6.3. Einstein’s Energy Drift-Diffusion Relations for Isolated and Open 
Systems 

In Sec. |6.1[ we showed that the Jarzynski equality can be viewed as a constraint on 


the cumulant expansion of the work generating function [see Eq. (144)]. Consider, for 
example, the Jarzynski equality for a cyclic process applied to a small subsystem of a 
large system or to a system initialized in a Gibbs state. In these cases, the Jarzynski 
equality is exact and, given the fact that the process is cyclic, AF = 0, so that 


= 1 . 


(167) 


Next, let us perform a cumulant expansion of this equality: 

0 = ln(e-^^) = -P{W) + lp^{W^)e + l:P^{W^), + ... . (168) 

2 6 

where {W‘^)c = {W"^) - {W)"^ and {W^)c = {W^) - 2,{W^){W) + 2{W)^. If the average 
work performed is small and its distribution is close to Gaussian or if temperature is 
high, then cumulants of order three and higher in the expansion above can be neglected 
leading to: 


0« -/3(IV) + J/3^(IE^) 


{W) = ^{W^)c 


(169) 


The relation above can be interpreted as a fluctuation-dissipation relation for a system 
coupled to an external noise. {W) is the average work characterizing energy dissipation in 
the system and (IT^)c characterizes the uncertainty in the work. Equation (169) can also 


be interpreted as an analogue of Einstein’s drift-diffusion relation for the energy. Indeed, 
{W) is the average work done on the system during the cyclic process, which is the energy 
drift, and (lE^)c represents the work fluctuations, which is the energy diffusion. 

Next we consider a system prepared in a single eigenstate (hence, the results equally 
apply to a setup where we start from an arbitrary narrow stationary distribution). We 


focus on the general expression for the work probability distribution, Eq. (157). For 
simplicity, we focus once again on cyclicp^ and symmetric protocols Then 


P{E, W) q-S{e+W)+S{e) ^ ^ 


(170) 


In order to proceed further, let us assume that the work W is small (though it can be 
extensive). Then, the probability P{E, W) is a slow function of the first argument and a 


the protocol is not cyclic the total work W is the sum of the adiabatic work Wad', which is not fluctuating, 
and the non-adiabatic work Q which is fluctuating. Then, the following derivation remains valid if one identifies 
W with Q. 

^^If the protocol is not symmetric one has to distinguish between forward and reverse processes. 
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fast function of the second argument W. Similarly, the entropy is a slow function of W. 
Formally, the RHS of the equation above can be rewritten as 

P{E + W,-W) =e^^^P{E,-W), (171) 

where we used the notation Oe = djdE. Finally, expanding the entropy S{E + W) in 
Eq. (170) to second order in W, and integrating over W, we find 


(e P'''' 2 bb)^ 


(172) 


where ■ .)e means that an average is taken with respect to the probability distribution 
P{E, W) at fixed E. Taking the logarithm of both sides of this relation, and performing 
the cumulant expansion to the second order, one finds 

- P{W)e + \i3\W^)e,c - ldEl3 {W^)e ~ -dE{W)E + ^ - (^^(kF)^)'] ■ 

(173) 

Note that {W‘^)e = {W'^)e,c + where the first term on the RHS is linear and 

the second is quadratic in cumulants. By equating all linear terms in the cumulantf^ 
Eq. (173) reduces to: 

— (3{W)e + 20^{W^)e,c — {W‘^)e,c ~ —dE{W)E + ijdltE^W'^) e,c ■ (174) 

Using that 

BeP {W^)e,c = dE{/3{W^)E,c) - l3dE{W^)E,c , (175) 

and regrouping all the terms, one finds that the equation above simplifies to 

-/3 (^{W)e - ^{W^)e,c - ldE{W^)E,c^ +dE [{W)e - ^{W^)e,c - ldE{W^)E,c^ = 0 . 

(176) 

Therefore, the first and the second cumulant have to satisfy the relation: 


{W)e = ^{W^)E,c+ldE{W^)E,c- 


(177) 


Equation (177) extends Einstein’s drift-diffusion relation (169) connecting work and 
work fluctuations for a system prepared in a single eigenstate (and hence to a micro- 
canonical shell). In large systems, the last term in (177) is a subleading correction since 
the energy is extensive. So, in the thermodynamic limit, this term can be dropped and 
one is back to Eq. (169). However, if one is dealing with a small chaotic system, then the 
last term cannot be neglected. As we will show in the next section, it has important im¬ 
plications for determining the correct asymptotic distribution of driven chaotic systems. 
Relation (177) was first obtained by C. Jarzynski for a single classical particle moving in 
a shaken chaotic cavity |212[ 1213] , and then extended to arbitrary quantum or classical 
systems along the lines of our derivation in Ref. |193j . 


^®Formally, this can be justified by introducing some parameter e such that all cumulants are linear in e. This 
parameter can be, for example, duration of the pulse e = dt or the size of the subsystem that is coupled to the 
driving term. Since the relation l |170[ ) is valid in all orders in e, it is sufficient to verify it only to linear order. 
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The exact same analysis can be carried out if, instead of a single driven system, one 
considers two weakly coupled systems I and II as illustrated in Fig. ^ For the purposes of 
this discussion, we will assume that both systems are ergodic. Let us assume that the two 
systems are initialized in eigenstates with energies Ei and Eu and then coupled weakly. 
This coupling leads to an energy exchange between the two systems. Since the dynamics 
is unitary, and after the assumption that each system is quantum chaotic, we can apply 
Eq. (170) to this setup. The only new ingredient is that the entropy S is replaced by 
the sum of entropies of the systems I and II (corresponding to the factorization of the 
densities of states of the uncoupled systems). Thus 


P{Ei ^ El+ W, Eu Eu -W) = P{Ei, Eu, W) 

^ ^Si{Ei+W)+Sii{Eu-W)-Si(Ei)-Sii(Eii)p -W). (178) 

Note that, here, we use W to denote the energy exchange between systems I and II (even 
though now it means heat). As we will see shortly, for isolated chaotic systems an external 
drive is equivalent to a coupling to an infinite temperature bath /3n- Therefore, we prefer 
to keep the same notation for both coupled and isolated driven systems. Repeating 
exactly the same steps as before, that is, expanding the two entropies and P with respect 
to W up to the second order and integrating over the distribution function P{Ei, Eu, W), 
one obtains (see Appendix : 


{W)e,,Eu = {W^)e,,Eu,c + 2 ~ ^Eu) {W^)Ei,Eu 


(179) 


where the symbol (... )ei,e„ means that an average is taken with respect to the prob¬ 
ability distribution P{Ei, Eu,W) at fixed Ei and Eu, the suffix “c” means connected, 
and Pi — Pii is the difference in temperature between the two systems. If both systems 
are large, then the last term is again subextensive and can be dropped. It is interesting 
to note that Einstein’s relation for coupled systems (179) reduces to the one for isolated 
systems (177) if the temperature of system II is infinite Pu = 0 and {W‘^)ei,Eii,c is either 


negligible or independent of Eu- Hence, from the point of view of energy flow, driving 
an isolated system by means of an external cyclic perturbation (like it happens, e.g. 

is equivalent to coupling it to an infinite temperature reservoir. 


m 


microwave ovens 


6.4. Fokker-Planck Equation for the Heating of a Driven Isolated System 


The drift-diffusion relation (177) can be used to understand energy flow (or heating) in 
a driven ergodic system not coupled to a bath. Let us imagine that the process consists 
of many pulses, well separated in time, such that the system can relax between pulses 
to the diagonal ensemble. Then, after coarse-graining, one can view this as a continuous 
(in number of pulses) process. If the mean energy deposited in each pulse is small then 
both (IT) and (IT^)c (as well as other cumulants of IT) scale linearly with the number 
of pulses and, hence, with the coarse-grained time. Next, instead of a series of discrete 
processes, one can consider a single continuous process. Then, as long as the relaxation 
time is faster than the characteristic time for the energy change in the system, (IT) and 
(kT^)c are approximately linear in time. It is well known that under such assumptions 
transport (energy transport in our case) can be described by the Eokker-Planck equation. 
The derivation of the Eokker-Planck equation is fairly standard (see, e.g.. Ref. |19r)j ) but 
we repeat it here for completeness. We start from the microscopic master equation (88). 
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For simplicity, assuming a cyclic process and dropping the tildes: 

Pn{t + 6t) = (180) 

m 


where 6t is a small interval of time. By assumption, oc 6t. Let us define the 

energy distribution function 


U{En,t) = Pn{t)Q.{En 


(181) 


Now, replacing the summation over m in the master equation by an integration over W 
(and multiplying by the density of states), recalling that the probability of performing 
work W is P{En-,W) = pn^m^{Em), where Em = En + W, and dropping the index n 


in En, we can rewrite Eq. (180) as 


n(E,t + <5t) 


dWP{E - W, W)U{E -W,t). 


(182) 


Note that P{E — W,W) is a fast function of the second argument with the width = 
{W‘^)c oc 5t, but a slow function of the total energy of the system, that is, of the first 
argument. Similarly, the probability distribution n(E, t) is expected to be a slow function 
of E on the scale of 5W. Hence, we can use the Taylor expansions: 


U{E -W,t) = U{E, W) - WdsHiE, W) + ..., 
P{E -W,W) = P{E, W) - WdEP{E, W) + ... 


By substituting these expansions in Eq. (182), and expanding to second order in the 
work W, we find: 


n(E, t + 6t)- U{E, t) = -{W)EdEU{E, t) - dE{W)Ell{E, t) + ^{W^)EdlE^{E, t) 

+dE{W^)EdEli{E,t) + ^dlE{W^)En{E,t). (183) 

Dividing the equation above by 5t, and using the notation 


Je 


{W)i 

St 


De 


{W^)e,c 

St 


(184) 


we find the Fokker-Planck equation 


= -dE[JEn{E,t)] + ^dlE[DEll{E,t)]. (185) 

To obtain this result, we notice that the cumulants of the work are linear in transition 
probabilities and, hence, they are linear in St. So, in the limit St —?■ 0, one can substitute 
{W^)e ^ {W^)e,c. 

This Fokker-Planck equation for the energy drift and diffusion is completely general. 
Effectively, it describes the evolution of the energy distribution function under many 
small uncorrelated pulses or under a continuous slow driving such that, at each moment 
of time, the system is, approximately, in a stationary state. In general, the drift Je and 
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diffusion De coefficients are independent. However, in ergodic systems, they are related 


to each other by Eq. (177), which, after dividing by 5t, reads |193| 


Je = + -j^dEDE- 


(186) 


Likewise, for open systems, Eq. (179) implies that 


Je = ~^^E + -j^dEDE 


(187) 


In the next section, we will see how powerful this result is for finding universal energy dis¬ 
tributions in driven isolated and open systems. We note that the Eokker-Planck equation 
can be derived using a different approach, as done in Refs. |215[ I216| . 

Interestingly, one can derive Eqs. (186) and (187) from a very simple argument |193j . 
Let us discuss it here only for an isolated setup, which is relevant to Eq. (186). It is 


straightforward to extend the discussion to open systems, relevant to Eq. (187). According 


to the discussion in Sec. 5.2.1 the only attractor for the probability distribution of a 


driven system is the infinite temperature distribution: p* = Ci, where Ci is a constant. 
This implies that n*(E') = C 2 exp[S'(E)], with C 2 another constant, should be stationary 
under the Eokker-Planck equation or, in other words, that 


- dE[JEJi*{E)] + hlE[DElE{E)] = 0, 


which, in turn, implies that 


- = C 3 = 0. 


(188) 


(189) 


The integration constant has to be equal to zero because one can go to (typically, 
low) energies where there are no states and, hence, Je = De = 0. Erom Eq. ( 189[ ), one 
immediately recovers the drift-diffusion relation (186). 


6.5. Fluctuation Theorems for Two (or More) Conserved Quantities 


Until now we focused on systems with only one conserved quantity, the energy. Quite 
often one deals with situations where, in addition to the energy, there are other conserved 
quantities, such as the number of particles, magnetization, momentum, charge, and vol¬ 
ume. In this section, we extend the fluctuation relations and their implications to such 
setups. To simplify our derivations, we will focus on systems with two conserved quanti¬ 


ties: energy E and the number of particles N. At the end of Sec. 6.6 we comment on how 
to extend our results to an arbitrary (non-extensive) number of conserved quantities. We 
focus on a particular setup, where two initially separated chaotic systems I and II are 


weakly coupled allowing for energy and particle exchange (see Eig. 24). The expressions 
obtained in this setup can also be used to describe a single system, say, system I, coupled 
to an external driving by setting dSu/dEu = dSu/dNu = 0. We assume that the two 
systems are connected to each other for a short period of time r and then detached again 


and allowed to equilibrate (i.e., reach a diagonal ensemble), see Eig. 24 It is intuitively 
clear that this assumption of connecting and disconnecting the systems is not needed if 
the coupling between them is weak and the two systems are in an approximate stationary 
state with respect to the uncoupled Hamiltonian at each moment of time. This intuition 
can be formalized using time-dependent perturbation theory (see Appendix [F]) . 
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Figure 24. Two systems are connected and exchange energy and particles. (Top) Initially, the two systems have 
a well-defined energy and number of particles. (Middle) Then they are connected and allowed to exchange energy 
and particles. (Bottom) Finally they are disconnected and each system is allowed to equilibrate at the new fixed 
values of energy and particle numbers. 


Repeating the same steps as in Sec. 6.3, one can straightforwardly generalize Eq. (170) 


to: 


_ P{Ei,En,Ni,Nu,W,6N) _ 

P{Ei + W, Eu -W,Ni + 6N, Nil - SN,-W, -6N) 


- „SiiEi+W,Ni+5N)+SiiiEu-W,Nu-5N)-SiiEi,Ni)-SuiEu,Nu) 


(190) 


where 


P{Ei,Eii,Ni,Nii,W,SN) 

= P{Ei El -\- TV, Ell 


Ell - W,Ni^ Ni + 6N, Nil Nil - 6N). 


If the energy and particle changes are small, one can expand the entropy and the 
probability distribution P in Taylor series. If W and 6N are small and non-extensive, 
then only the leading derivatives of the entropy need to be kept and we find a Crooks-type 
relation for two conserved quantities: 

P{Ei, Ell, Ni, Nil, W, ^ P{Ei, En, Ni, Nn,-W, -6N), (191) 


where A/3 = /3i — /3ii, and Ak = ki — kh, with 


Ki = 


dSi 

dN, 


=-PiUi, / = I,II 


(192) 


As in Sec. 6.1.1, this relation is exact if the two systems I and II are described by grand 


canonical distributions. It is also asymptotically exact to order I /N (with N of the order 
of the number of particles in each system) irrespective of the distribution, if the two 
systems are extensive while the energy and particle exchanges are not (which is, e.g., 
the typical setup if two large macroscopic systems are connected through a surface). 
For the specific case of effusion of an ideal gas between two reservoirs kept at different 


temperatures and chemical potentials, Eq. (191) was derived microscopically in Ref. |217| . 
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Following Ref. [208], we can use the fluctuation relation (191) to derive the symmetry 
property of the cumulant generating function for W and SN, see Eq. (153). Namely, 
multiplying both sides of Eq. (191) by 

F(Ei, Eli, iVi, Nu, W, = P(Ei, En, Vi, Vn, -W, 

(193) 

for arbitrary ( and t], and integrating over W and 6N, we find 

G{A(3-C,AK-rj) = G{C,v) ( 194 ) 

together with the normalization condition G(0,0) = 1. Eor the particular choices of C 
and rj the symmetry relation (194) is equivalent to two different Jarzynski-type relations: 

(exp[—A/5VE — Ak6N\) = 1, (exp[—A/3VE]) = (exp[—AK(iV]), (195) 


where the left relation holds for ( = rj = 0 and the right for (^ = 0, rj = Ak. As in Sec. 6.3 
the angular brackets imply averaging over W and 5N starting at EpEn and Ni,Nii. 


6 . 6 . Linear Response and Onsager Relations 

Continuing with the setup of the previous section, one can perform the cumulant expan¬ 


sion of Eq. (195) up to the second order, or, alternatively, perform a Taylor expansion 


of the cumulant generating function and using relation (194). One obtains: 


Ad^ Afi-^ 

-A/3{W) - Ak{6N) + -^{W^)c + —{SN^)c + AI3Ak{W5N)c = 0, (196) 


A«2 a 2 

-AP{W) + = -Ak{6N) + —{6N^) 


(197) 


Eirst, we move all the terms to the left-hand side (LHS) of Eq. (197), then by adding 
and subtracting the resulting expression with Eq. (196), we obtain the two completely 
symmetric relations: 


{W) = ^{W\ + ^{W6N),, 


{6N) = ^ {5N^)c + ^ {W6N),. (198) 


If the coupling between the systems is weak, then either from Eermi’s golden rule or using 


the same arguments as presented above Eq. (180), we conclude that {W) and {5N) (as 
well as other cumulants such as (VE^)c) are linear functions of the coupling time 5t. Then 
one can define the energy and particle currents Je = {W)/5t and Jm = as well 


as Dww = {W^)c/St, ~ and Dwn = {W6N)c/St. Equations (198) can 

then be rewritten in the matrix form |218j : 


Je \ ^ 1 Dww Dwn \ f A/3 \ 
Jn ) 2 V Dwn Dnn / V / 


(199) 


These equations are analogous to Eq. (169) and are known as the Onsager relations. 


On the LHS one has the energy and particle currents, while on the RHS one has the 
symmetric fluctuation matrix multiplied by the thermodynamic biases {Af3 and An) 
that drive the energy and particle currents. Note that we obtained these relations as a 
cumulant expansion, not as a gradient expansion. This means that they remain valid 
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for arbitrarily large A/3 and Ak as long as the distribution P{W, 6 N) is approximately 
Gaussian. For this reason, the diffusion matrix in Eq. (199) does not have to be the one in 
equilibrium, that is, does not have to correspond to the one for A/3 = Ak = 0. When the 
temperature and the chemical potential gradients are small, contributions from higher 
order cumulants are suppressed because of the higher powers of A/3 and Ak. As a result, 
the usual Onsager relations apply. 

This derivation of the Onsager relations applies to large systems. As with the Einstein 
relation (179), there is a correction that can be important for small systems. The deriva¬ 
tion of this correction is analogous to the derivation of Eq. (179). The starting point is 
now Eq. (190). One needs to expand the entropy and the probability distribution as a 
function of Ei^n ± W and Aiji ± 6 N to second order in W and 6 N, and then to carry 
out a cumulant expansion with these additional corrections. We leave the details of the 
derivation to Appendix [Gl and show only the final result, which is the natural extension 
of Eq. ( [17^ : 


{^) = ^{W^)c + ^{W5N), + \dE{W^)c + \dN{W5N),, 

{ 6 N) = ^{ 6 N^)c + ^{W 6 N), + ^dN{ 6 N^)c + ^dEiWdN),. (200) 


The terms with derivatives are clearly subextensive and not important for large systems, 
but they can play an important role in small or mesoscopic systems. It is interesting to 


note that the Onsager relations can still be written in the conventional form (199) if one 
redefines the energy and particle currents as 


Je = -{w)-:l-dE{W^)c-:]-dN{W6N) 

T Zt It 


Jn = -{6N) ——dN{6N‘^)c—-^dE{W6N)c ■ 

T It It 

( 201 ) 

Let us comment that these results immediately generalize to more (M > 3) conserved 


quantities. Eor example, in the Onsager relation (199) one will need to use M-component 


vectors for the currents and the gradients and a symmetric M x M diffusion matrix. 


Similarly, one can generalize corrections to the currents (201) writing them using an 
M-component gradient form. 


6 . 7 . Nonlinear Response Coefficients 


Along the lines of Refs. |208l I218j . one can go beyond the Onsager relation and use 
the symmetry relation of the generating function (194) to constrain nonlinear response 


coefficients. It is convenient to work using a vector notation for M-conserved quantities 
SN, where 6 Ni stands for the energy, and 6 N 2 ,... , 6 Nm for other conserved quantities. 
Similarly, let us denote by Ak the gradients of affinities (or thermodynamic forces) 
Ka = dS/dNa (such that ki = j3) and Akq, is the difference in affinities between the 


systems I and II. Ignoring subextensive corrections, the symmetry relation (194) reads 


g{Ak-0 = g{c), 


( 202 ) 


where we recall that 


exp 




= exp[G(C)]. 


(203) 
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Differentiating this equality with respect to (a and using that G(0) = 1, we find 

dG(C) 


Ja = {6N^) = - 


dCa 


( 204 ) 


C=o 


To simplify the notation, we set r = 1 so that {SNa) = Ja is the current of the a-th 
conserved quantity. The symmetry relation (202[) thus implies that 


Jf)j — 


dG{C) 


dG{AK-C) 

dG{0 

d(a 

C=0 

dCa 

(=0 


(205) 


(^=Ak 


Let us write explicitly the cumulant expansion of the generating function G{C)- 

G{C) = ~ ^ ^ CaJa + X ^ ^ Da/sCaC/S ~ ^ ^ + • • ■ j (206) 


q;/3 


af3'y 


where Dap = {5Na5Np)c is the covariance matrix (second-order joint cumulant matrix), 
is the third-order cumulant tensor, and so on. By substituting this expansion into 
Eq.lfe.wefind 


2Ta = ^ Dai3^Ki3 Mc^^Ak/sAk^ + ... (207) 

0 01 

Using this expansion and the symmetry relations of joint cumulant tensors, such as Dap = 
Dpa and = Mp^^y, one can extend the Onsager reciprocity relations to higher order 

cumulants. As in the previous section, these results extend to externally driven systems 
by substituting Ak by kj. Note that because we carry out a cumulant expansion, and not 
a gradient expansion, all cumulant tensors, such as D^^p and M^p-y, are functions of Ak, 
that is, they are evaluated away from the global equilibrium corresponding to Ak = 0. 
If one further re-expands these tensors in gradients Ak, one obtains a more standard 
gradient expansion around equilibrium [208]. 


6.8. ETH and the Fluctuation-Dissipation Relation for a Single Eigenstate 


In Sec. 4.2, we introduced the ETH ansatz for the matrix elements of physical operators in 


the eigenstates of a quantum chaotic Hamiltonian (62). This ansatz contains two smooth 
functions of the mean energy E = (E„ -|- £'m)/2, and the energy difference uj = Em — E^. 
The first function, 0{E), is nothing but the microcanonical average of the observable 
O. The second function, fo{E,uj), contains information about the off-diagonal matrix 
elements of the operator O. Let us elaborate on the second function here and show its 
relation to nonequal-time correlation functions of the observable O (see also Ref. |154j L 
In parallel, we will be able to prove the fluctuation-dissipation relation for individual 
eigenstates. 


We begin by using the ETH ansatz (62) to analyze the quantum fluctuations of an 
observable O in the eigenstate \n): 


bOl = (n|d» - (n|6|n)2 = ^ \Onm\^ = e-^^^-+^/^^\fo{En + uj/2,u)\^\R, 




m^n 


(208) 


79 





















August 2, 2016 


Advances in Physics Review 


where we used that E = {En + Em)l‘2 = En + oj/2. For concreteness, we consider the 
most general case in which the matrix elements of observables are complex. 

Because of the ETH requirement that the function fo is smooth, the fluctuations of 
\Rnm\‘^ average out in the sum and one can replace the summation over states m by 
an integration over uj: Ylim f dio^iEn E co) = f dw exp[S(£'„ + w)]. To shorten the 
notation, we will drop the subindex n in En and, unless otherwise specified, will identify 
E with the energy of the energy eigenstate |n). Then 


60l = 


/ oo 

duj e^iE+u^)-SiE+u^/2) \fo(E + oj/2,u;)f. 

-oo 

(209) 

We are interested in expectation values of few-body (usually local) operators (such as 
the magnetization in a given region of space) or sums of those operators (such as the 
total magnetization). This kind of operators generally connect states that differ by non- 
extensive energies, implying that the function fo{E -j-uj/2,co) rapidly decreases with 
the second argument w 1291 [EH (see Figs. [l6| and |17[). On the other hand, the entropy 
S{E + ca) and the function fo as a function of the hrst argument can only change if 
u! changes by an extensive amount. This means that one can expand these functions in 
Taylor series around ui = 0: 


/o(E + ^/2,u,) = foiE.o) + + .,,. 


By substituting this expansion in Eq. (209) and keeping terms up to the linear order in 
UJ, one obtains 


50i 




\foiE,uj)\^ + 


d\foiE,uj)\‘^ uj' 
dE 2 


( 210 ) 


The function fo{E,uj) therefore determines the quantum fluctuations of the operator 
O in eigenstates of the Hamiltonian and, as a result, in the associated microcanonical 
ensembles. 

In passing, we note that the result above shows that fluctuations of O within each 
eigenstate are slow functions of the energy. Combining this observation with an earlier 


discussion of fluctuations of observables in the diagonal ensemble (see Sec. 4.2.1), we can 


rewrite Eq. (72) as 


50^^501+1^^'^ 5E\ (211) 

where n is the eigenstate corresponding to the mean energy: En = {E). We see that 
fluctuations of any observable in a diagonal ensemble have essentially two independent 
contributions, the hrst coming from huctuations within each eigenstate and the second 
from the energy huctuations. For extensive observables, these two contributions are of 
the same order but, for intensive observables conhned to a hnite subsystem, the second 
contribution becomes subleading and all huctuations are essentially coming from 80^. 
This is nothing but a manifestation of equivalence of ensembles applied to the diagonal 
ensemble. 
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The previous derivation immediately extends to more general nonequal-time correla¬ 
tion functions: 


C'o(t) = (n|d(t)d(0)|n)c = (re|d(t)d(0)|n) - (n|d(t)|n)(n|d(0)|re), (212) 

where 0{t) = jg operator in the Heisenberg picture. Repeating the same 

steps as before, one finds 


Co{t) 


f 




\foiE,io)\^ + 


d\fo{E,uj)\‘^ Lo 
dE 2 


(213) 


It is convenient to define the spectral density of the operator O as the Fourier transform 
oiCo{t) 


/ oo 

dt 

-OO 


(214) 


Substituting here the expression for Co{t), one obtains 


\fo{E,co)\^ + - 


2 , d\fo{E,u})\^ e 


dE 


2-k 


SoiE,u}). 


(215) 


Noting that \fo{E,(jj)\‘^ is an even function of w and uj\fo{E,uj)\‘^ is an odd function, 
changing u —>• —u in Eq. (215), and adding and subtracting the two resulting equations, 
one finds that 


\fo{E,io)f = ^ e-f^‘^/^So{E,io) +e^‘^/^So{E,-uj) 


d\foiE,co)(^ 

dE 


dvr - 


= — [cosh(/3t<;/2)5Q(E,a;) — smh{(5uj/2 )Sq{E ,oj)\ 


[cosh(/3u;/2)5Q (E, w) — sinh(/3a;/2)5Q (E, w)] , (216) 


27ra; 


where 




/ OO 

{n\{0{t), d(0)}|n 

-OO 


— OO 
OO 


SoiE,uj) = So{E,u;) - So{E,-u;) = / dte^‘^\n\[O{t),O{0)]\n) 


(217) 


are the Fourier transforms of the symmetric and antisymmetric correlation functions, 
{•, •} stands for the anti-commutator and [•, •] stands for the commutator of two opera¬ 
tors. The symmetric correlation function appears in the quantum fluctuations of phys¬ 
ical observables and the antisymmetric correlation function appears in Kubo’s linear 
response. We thus see that the absolute value of the function fo{E,Lj) and its deriva¬ 
tive with respect to E are determined by the Fourier transforms of the symmetric and 
antisymmetric nonequal-time correlation functions of the operator O taken in the many- 
body eigenstate state |n), or, equivalently, in the microcanonical ensemble consisting of 
a single eigenstate with the energy E = E^. The phase of this function is not uniquely 
defined because, in Eq. (62), the random function Rnm is defined up to a random phase. 
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One can also invert the relations (216) and obtain 
Sq{E,uj) = Att 




21 


cosh(/3u;/2)|/o(-E, w)| + - sinh(/3a;/2) 


Sq {E, oj) = dvr 


UJ 


sinh(/3a;/2)|/o(-E,a;)| + -cosh(^a;/2) 


dE 

d\fo{E,u:)\^ 


dE 


( 218 ) 


Let us recall that S{E,u}) also determines Kubo’s linear response susceptibility (see, 
e.g., Ref. mn]): 


r*CO coo 


Xo{(^) = i / dte^‘^ ([O(t),O(0)]) = i / dte*^ [Co{t) - C'o(-t)], (219) 

Jo Jo 

where we used that, for any stationary distribution (including an energy eigenstate). 


{0{0)0{t))c = {O{—t)O{0))c = Co{—t). Using Eq. (213), and that 


= •k 5 {v) + iP 


( 220 ) 


where V{l/v) stands for the principal value, we find 


Xo{E,uj) = 2TTi 


sinh(/3a;/2)|/o(E',a;)p + cosh(/3a;/2)- 


u d\foiE,uj)\^^ 


dE 


7. 


+ V du 


2\fo{E ^.)| 2 sii^h(/lt// 2 ) d\fo{E,v)\‘^ i/cosh(/3i//2) 


V — OJ 


dE 


U — OJ 


Hence, the imaginary part of Kubo’s susceptibility is also determined by \fo{E,oj)\‘^\ 
dJlxoiE^oj)] = 27r 


|/o(E,a;)7sinh(/3a;/2) + cosh(/3cj/2) 


= -^SoiE^oj). 

( 221 ) 

If O is a local operator, or a sum of local operators, then the terms with derivatives of 


the total energy become unimportant for very large systems, and Eqs. (218) and (221) 
simplify to: 

So{E,oj) Ri 47rcosh(/3w/2)|/o(E,u;)7, X[xo{E,oj)\ Ri 27r|/o(E,u;)^ sinh(/3a;/2), 

( 222 ) 

which imply the famous fluctuation-dissipation relation |219j : 


S'^{E, w) Ri 2 coth dJ [xo{E, w)] . 


(223) 


We see that, as with the fluctuation theorems, the fluctuation-dissipation relation does 
not rely on the assumption of a Gibbs distribution. It is satisfied for every eigenstate of 
a chaotic Hamiltonian (away from the edges of the spectrum, where ETH is expected to 
hold), and hence for any stationary ensemble with non-extensive energy fluctuations. 

For finite systems, one can calculate corrections to the fluctuation-dissipation relation. 
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For example, combining the relations in Eq. (216), one finds that 


■ /3w / a;2 \ a; jiu 


S*{E,u) 


\ LO Bui ' 


So{E,u), (224) 


which replaces the standard relation 


sinh ~ cosh {E, ui), 


(225) 


valid for Gibbs ensembles or for individual eigenstates in very large systems. This more 


general fluctuation-dissipation relation for individual eigenstates (224) still connects the 
noise and the dissipative response but in a more complicated way. 


6.9. ETH and Two-Observable Correlations Functions 


In the previous section we established a relation between fo{E, ui) and the Fourier trans¬ 


form So{E,ui) of the nonequal-time correlation function of O, see Eq. (216). Here, we dis¬ 


cuss how the ETH ansatz for two observables and are related to their nonequal¬ 
time (connected) correlation function (n|0^^^(t)0^^^|n)c. Because of its experimental rel¬ 
evance, a case of particular interest is when = 0(xi) and = 0{x2)- This because 
(n|0(xi, t)0(x2, 0)|n)c determines the 0-0 structure factor. Another commonly encoun¬ 
tered situation correspond to and representing different components of some 
observable, such as the magnetization, the current, and the electric polarization. 


We rewrite the ETH ansatz for the two observables [see Eq. (62)] as 


Ogi = 0^^\E)6mn + e-^^^y^TliiiE,co), 


(226) 


where j = 1, 2, and Tmn(E,ui) = fou){E,uj) R^n ■ This allows us to write 


(n|di(t)d2(0)|n)c = I duje^‘^/‘^-^^^Ki2{E + ui/2,oo), (227) 

where the noise kernel 


Ki 2 {E -\- ca/2, uS) — ThmiE ui/2, ui)Tmh(E -|- ui/2, ui). (228) 

The overline indicates an average over states |m) within a narrow energy window, that is, 
at fixed E and uj (or, equivalently, an average over a fictitious Random Matrix ensemble). 

It is apparent in the expressions above that the noise terms R^n R^n must, in 
general, be correlated with each other, or else (n|Oi(t)O2(0)|n)c = 0. Hence, the assump¬ 
tion that they are random numbers with zero mean and unit variance is oversimplifying 
and only applicable if we are interested in (n|O(t)O(0)|n)c. In order to generalize ETH 
to deal with nonequal-time correlations of different observables, one can still take Tmn to 
be Gaussian with zero mean but its noise kernel with other observables generally needs 
to be nonvanishing. 


83 











August 2, 2016 


Advances in Physics Review 


Inverting Eq. (227), one has that 


Ki 2{E + u)/2 ,lo) = — - - Si2{E,ijj), (229) 

ZTT 

where 

/ OO 

dte*‘"*(n|Oi(t)O2(0)|n)e. (230) 

-OO 

For nonequal-space correlation functions in translationally invariant systems, when 
indexes 1 and 2 represent spatial coordinates xi and X 2 for a given observable O, one 
can write 


Ki 2{E + u}j2,uj) = K{E + a;/2, w,xi - X 2 ). 


(231) 


In this case it is convenient to define the spatial Fourier transform K and work in the 
momentum space, as one does with structure factors. If the system exhibits Lorentz or 
Galilean invariance one can further restrict the functional form of the noise kernel. 


We note that, as we did in Sec. 6.8 one can further simplify Eq. (227) splitting the 


correlation function into its symmetric and antisymmetric parts and Taylor expanding 
the noise kernel Ki^ 2 {E+ 0 :/2,u) with respect to the first argument. However, we should 
stress that the extension of ETH that we have introduced in this section still needs to be 
verified numerically. In particular, one needs to understand the regime of validity of the 
Gaussian ansatz and its applicability to the study of higher order correlation functions. 
These important questions are left open to future investigations. 


7. Application of Einstein’s Relation to Continuously Driven Systems 


In this section, we discuss several examples illustrating how one can use the Einstein 
relation to obtain nontrivial information about driven systems. We focus first on driven 
isolated systems which, in the absence of the drive, have only one conserved quantity 
(energy). We study the energy distribution obtained after a generic quasi-static pro¬ 
cess, where the driven system is approximately in equilibrium at each moment of time. 
Schematically, a quasi-static process in an isolated system can be represented as a series 


of small quenches and relaxation to the diagonal ensemble (see Fig. 25). As described 


in the previous section, the same setup applies to continuous driving protocols provided 
that the relevant relaxation time in the system is fast compared to time over which the 
energy of changes significantly]^ As we will show, this setup, besides being common, 
allows one to take full advantage of the predictive power of the fluctuation theorems to 
derive results even if the overall energy change in the process is not small, and which, 
as we will see, might lead to energy distributions that are non-thermal. This setup is 
analogous to heating in a microwave oven. In the latter, heating occurs not due to the 
coupling to an external heat reservoir (like in the conventional oven) but rather due to the 


non-adiabatic work performed by the time-dependent electromagnetic field, see Fig. 26 


Even though this field is periodic in time, the typical relaxation time in the system is 
much faster than the pulse frequency and therefore the process is quasi-static. Such a 
process is quasi-static but it is not adiabatic since each electromagnetic pulse performs 


^®In standard thermodynamics, by a quasi-static process one usually understands a process in which the irreversible 
work comes from heat exchange with a heat bath. However, such a definition is very restrictive. 
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Figure 25. Schematic representation of a continuous process as a series of quenches followed by relaxation to the 
diagonal ensemble. This approximation is justified if the relevant relaxation time in the system is short compared 
to the characteristic time scale associated with the change of A. 


irreversible work dQ [see Eq. (138)] on the system. That work accumulates and leads 
to heating. This heating can be described by the Fokker-Planck drift-diffusion equation, 
where the drift and diffusion terms are connected by Einstein’s relation (see Sec. 6.4). 




Figure 26. Schematic comparison between the usual thermal heating (traditional oven, top) and an energy increase 
due to non-adiabatic work (microwave oven, bottom). 


In what follows, we apply these relations to several setups and show how they allow 
one to make nontrivial statements about the asymptotic energy distribution after long 
times. Moreover, one can even predict the existence of dynamical phase transitions. 
All the examples analyzed in this section are classical. The reason is that microscopic 
simulations of long-time dynamics in quantum chaotic systems are very difficult. It is 
expected, however, that the general expressions and the Fokker-Planck formalism apply 
equally to classical and quantum systems. 


7.1. Heating a Particle in a Fluctuating Chaotic Cavity 

We start by considering a very simple problem, that of a classical particle bouncing 
elastically in a cavity in two spatial dimensions. When the cavity is stationary, the 
energy of the particle is conserved. If the cavity is chaotic, in the long-time limit, the 
particle reaches a uniform position distribution and an isotropic momentum distribution. 
We consider a process in which the system is repeatedly driven by deforming the cavity. 
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Figure 27. Schematic representation of a particle moving in a two-dimensional chaotic cavity with moving bound¬ 
aries. The driving protocol consists in repeatedly deforming the cavity between the two shapes shown, while keeping 
its volume fixed. 


At the end of each cycle, the cavity comes back to its original shap^^ and the system is 
allowed to relax in the sense described above (see Fig. 27). In a single collision with the 
moving wall, the particle’s kinetic energy can either increase or decrease. However, it will 
always increase on average and eventually the particles velocity will become much greater 
than the velocity of the wall, so that the work per cycle automatically becomes small. 
The assumption that the cavity is chaotic implies that there are no correlations between 
consequent collisions. If this is the case, then one can consider a continuous driving 
protocol instead of repeated quenches and all the results will be the same. Such a setup 
was analyzed by Jarzynski |2I3j followed by other works |193l I220II224] . An interesting 
and nontrivial result that emerges from this analysis is a nonequilibrium exponential 
velocity distribution (to be contrasted with the Gaussian Maxwell distribution). Here, 
we analyze this problem in two ways. First, using standard kinetic considerations and 
then using the Einstein relation. 

Let us denote the velocity of the particle as v and velocity of the wall as V. Note 
that the wall velocity is perpendicular to the boundary and, by convention, it points 
outward, that is, V = Vh, where h is the outward normal vector to the boundary. Since 
we assumed that the cavity deformations are volume preserving V averaged either over 
time, or over the boundary of the cavity, is zero. By elementary kinematics we know that, 
during a collision, the component of the relative velocity perpendicular to the boundary 
is reversed while the component parallel to the boundary is unchanged: 


= v-2 


^F-h — I/-n^n = 'U — 2 ^Fj_ 


(232) 


where, in the last equality, we have used that V = Vh and v± = v±h = {v ■ h) h. Note 
that for a collision to happen we need to have v_\_ — V >0 indicating that the particle 
and the wall are approaching each other. As a check, we verify that when the boundary 


is stationary, that is, H = 0, the update rule (232) simplifies to 


v' = V — 2Fj_ = F|| — Fj_ 


(233) 


where Fy = F — F_j_. As expected, this expression simply states that the perpendicular 
component of the velocity is reversed while the parallel component is unchanged. 


®®This requirement can be further relaxed. It is only important that the cavity comes back to the original volume, 
as the density of states only depends on the volume. 
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The energy change of the particle during a collision is 

AE='^ (|i;'|2 - |i;|2) = 2m {V^ - Vv±) . (234) 


We therefore see that the sign of the energy change depends on the sign ol V'^ — Vv±. 
Combining this result with the constraint v_\_ — V >0, which ensures that a collision takes 
place, we obtain that: i) the energy increases if the boundaries move inward, i.e., 14 < 0 
and ii) the energy decreases if the boundaries move outward, i.e., V > 0. The probability 
of a collision per unit length, L, is proportional to the relative velocity between the 
particle and the wall — V (provided this is positive). For the collisions where the 
energy of the particle increases (i.e., 14 < 0) the latter is 


c c Sf 

P> = jj^iv±-V)e (uu - V) = —(uu + |F|) 0 (uu + 14^1), (235) 

where 6 is the Heaviside step function, c is a proportionality constant (for many particles, 
c is proportional to the particle density), and 8 was introduced for convenience. On the 
other hand, for collisions in which the energy of particles decreases (i.e., 14 > 0) the 
probability of collision is 


cSf c Sf 

P< = jj^iv±-V)e (uu - V) = —(uu - |H|) 0 - |H|). (236) 


We are interested in the limit in which the wall moves slowly compared to the particle, 
|u_l| ~ |u| 3> 1141 and therefore the two-step functions above can be simplified to 


I.e., 


9{v±), indicating that u_l > 0. Rewriting expression (234) for the energy increasing 
(decreasing) collisions as 


AE> = 2m {V^ + |H||ux|) , AF;< = 2m {V^ - iHlIu^l) , 


(237) 


and using the expressions above for the collision probabilities, we find the average heating 
rate (energy drift) and the energy diffusion: 


Je = ^ [A£'>p> AF;<p<] = cmV^ |u_l| 0{v±), 

De = 1^ [{AEyfpy + (Ai?<)2p<] PS cm^V^]^e{vE), 


(238) 


where, in the second line, we have kept only the leading contribution in |14|/u_i_. The 
overline indicates averaging over the velocity distribution. It is convenient to use polar 
coordinates, where u_l = |i;|cos(0) with cf) G [—7r/2,7r/2] to ensure u_l > 0. Using that 
the particle energy is E = m|iTp/2, we obtain 


Je 


De 


cmV^ /2Uy/^ 1 

L \m ) 27r ^ 

L \m ) 2 tt 


r/2 

—k/2 
7r/2 


dcj) cos(p, 
d(j) cos 4>^. 


'-7r/2 


(239) 
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Computing the integral over the angle </>, we obtain 


Je = cVe, De = C-E^^ 

o 


( 240 ) 


where C = cV‘^ \/2m/(L7r) is a constant with dimensions of ^energy/time. Note that 
when computing the averages over v± in Eq. (239), we have used a uniform measure, 


that is, we took all angles (j) to be equally probable. This is justified because the cavity is 
assumed to be chaotic. We note that the same analysis can be carried out in an arbitrary 
spatial dimension d. The only difference arises when computing the corresponding angular 
integrals. A straightforward generalization of the analysis above leads to: 


Je = Cd'/E, De = Cd 


d+1 


-E^l\ 


(241) 


where Cd is an overall constant that depends on the dimensionality d. 


These coefficients satisfy the Einstein relation (186): 


2Je = I3{E)De + BeDe- 


(242) 


To see this, we note that the single-particle density of states in d dimensions is Q{E) oc 
j^(d- 2 )l 2 therefore 


m = dElnlQ(E)] = 


d-2 
2E ■ 


(243) 


Combining this with Eq. (241), it is easy to see that Einstein’s relation is indeed satisfied. 


In fact, there was no need to carry out these relatively elaborate calculations. It was 
sufficient to note that Je in all dimensions must be proportional to '/E, which, for 
example, follows from the fact that the average number of collisions is proportional to 
|u_l| ~ '/E. Then, the Einstein relation immediately hxes the energy diffusion De with 
respect to the energy drift Je- 


With relation (241) in hand, we can rewrite the Fokker-Planck equation describing the 


heating process in an arbitrary spatial dimension d as 


dtP(E,t) = -CddElVEP(E,t)] + —CddEElE^^^P(E,t)]. (244) 


As a preliminary step, we define a new variable f = tCd and write dt = Cd df so that 
the constant Cd disappears from the Fokker-Plank equation for P{E,t'): 

drP{E, t') = -dEWEP{E, t')] + ^aEE[E3/2p(E, t')] (245) 

We observe that, by a scaling analysis of this equation, t' ~ \/~E. Therefore, we can use 
the following scaling ansatz 


P{Ep') 


2at'y/E 


f 



1 

2day/E 


fiO- 


(246) 


where a is a constant of order one, which will be fixed later, and the prefactor l/{2at'y/E) 
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has been chosen so that the normalization condition becomes 

dE ( VE^ 


1 = / dEP{E,t') = , , 

Jo Jo 2at'VE at' 


dUiO- 


(247) 


By substituting ansatz (246) into the Fokker-Planck equation (245), we find: 


2a(l + d)f{^) + [2 — d + 2a{l + d)^] f'{i) + if"{Cj — 0. 


(248) 


It is therefore convenient to choose a ^ = 2(1 + d) so that the equation assumes a 
particularly simple form: 


/(0 + ( 2 -d + 0/'(0 + e /"(0 = o 


/(6 = 


1 


(d-l)! 


e 


d-l 


(249) 


where the normalization constant was fixed using Eq. ( 247| ). 

Defining r = at' = tCd/\2{d + 1)], we obtain the final result for the asymptotic energy 
distribution P{E,t) (see also Refs. |223L 1224] !: 


P{E t) = _ ^ ^ ^ f.-'/E/i 

^ 2(d-l)! T<^ 


(250) 


This distribution is universal in the sense that it does not depend on any details of the 
driving protocol. It is interesting to compare this result with the equilibrium canonical 
distribution at temperature /?: 


Pc{E,13) = 


\--l3P‘^E' 


r[d/2] 


2 e 


-ftE 


(251) 


Clearly, these two distributions are different but share some properties. In particular, 
they both decay in energy faster than any power law so that all energy moments are 
well dehned. However, P{E, r) decay in energy is slower than the Pc{E, (3) one, that is, 
the former is “wider”. To quantify this, we compute the first and second moments of the 
energy with respect both to P{E,t) and Pc{E,f3): 


{E)r= dEEP{E,T) = d{d+l)T^, {E^)r = d{d + l){d + 2){d + 3)r^ 

° (252) 

(E), = dEEP,{E, (3) = 

and define the relative energy width as a figure of merit to compare the width of the two 
distributions: 


^ = Ml _ 1 = 6 + 4^ M = M£ _ 1 = 2 

(E)2-(E)2 d{d + iy {E)l- {E)l d- ^ > 

Clearly, the nonequilibrium distribution P{E,t) is wider than Pc{E,l3) in any spatial 
dimension. In particular, as d increases from d = 1 to d = oo, the ratio of the relative 
energy widths changes from 2.5 to 2. In the analysis above, the dimensionality d only 
enters through the density of states. If we deal with a gas of weakly interacting particles in 
three dimensions, and the relaxation time of the gas is fast compared to the characteristic 
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rate of energy change due to the cavity’s motion, then the same analysis can be applied. 
The only difference is that d —)• 3N, where N is the number of particles. Therefore, the 
result that the asymptotic width of the energy distribution of a driven gas is twice as large 
as the width of the Gibbs distribution applies to any weakly interacting many-particle 
gas in any spatial dimension. The gas asymptotically approaches a universal distribution 
(at least has universal energy fluctuations), but it is not the canonical distribution. In 
the examples that follow, we will show that a generalization of this result applies to 
arbitrary driven interacting systems. 

Since the distribution P{E, r) is a nonequilibrium distribution, it should also have 
lower entropy than the equivalent Gibbs distribution. The two entropies are 


Sr 

Sc 



dEP{E, t) In 
dEPc{E, /3) In 


n{E) 

' PciE,py 

n{E) 


d{l + Inr) + In (2r[d]), 

G(l-l.i/S) + ln(r[d/2]). 


( 254 ) 


In order to compare these two entropies, we evaluate Sc at /3 ^ = 2{d + l)r^ (so that the 
distributions P{E, r) and Pc{E, /3) have identical average energy) and compute Sc — Sr- 


S, - Sr =- In 


2{d + 1 ) 


+ In 


(v\m 

\ 2r[d] 


(255) 


This function is positive and increases monotonically from 0.07 for d = 1 to (1 —ln2)/2 ~ 
0.15 for d —)• oo. Note that, for large d, the first term of both entropies in Eq. ( |254 ) is 
proportional to d, so the relative difference between them decreases with the dimen¬ 
sionality!^ According to our previous discussion, in a weakly interacting gas one has to 
substitute d —)• 3N so that, in the thermodynamic limit {N —>■ oo), the driven gas has 
a thermal entropy up to subextensive corrections. If, conversely, we are dealing with a 
gas of noninteracting particles (implying that the their relaxation time is longer than 
the heating time) the entropy difference between the driven and the thermal gas will be 
extensive. This happens despite the fact that the total energy distribution of the nonin¬ 
teracting gas is still a Gaussian due to the central limit theorem. This extensive entropy 
difference can be used, for example, to build more efficient heat engines and even to beat 
the fundamental Garnot bound in some cases (see the discussion in Ref. |225j l. 

Let us note that the form of the distribution (250) was obtained under the assumption 
of constant driving, i.e., V'^ = const. If one uses feedback control mechanisms such that 
velocity of the wall is tied to the velocity of the particle, i.e., V = V{E), then one 
can change the energy dependence of Je oc V'^Ve and, hence, change the resulting 
nonequilibrium distribution. One can even induce dynamical phase transitions (see the 
next example). 

In passing, we note that the same results for the energy distribution have been derived 
in a the context of the Lorentz gas |226j . This gas is defined as a system of noninteracting 
light particles colliding with an interacting gas of heavy particles moving with an average 
velocity V. If the ratio of the masses is very large, then there is no effect of the collision 
on the heavy particles so the latter serve precisely the role of moving boundaries. In 
this case, the behavior of the light particles can be obtained exactly via the Lorentz- 
Boltzmann kinetic equation. It is interesting to note that the ensemble of heavy particles 
can be viewed as an infinite temperature heat bath. Indeed, the average energy of heavy 
particles M{V‘^)/2, which defines temperatnre, diverges in the limit M ^ oo at fixed 


^^The /5 or r independent terms in both entropies play no role in thermodynamics. 
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Thus, according to the general discussion of Sec.[^ this simple example shows that 
an external quasi-static driving of an isolated system is equivalent to the coupling to the 
inhnite temperature bath. 

While the single-particle example considered here is relatively simple, it teaches us 
several important lessons that can be extended to many-particle systems. In particular, 
it shows: (i) the possibility of nonequilibrium universal distributions, and (ii) that doubly 
stochastic evolution for quasi-static driving protocols is sufficiently constraining to pre¬ 
dict such universality irrespective of the details of the driving protocol (in our example, 
the details are encoded in the overall constant Cd)- 


7.2. Driven Harmonic System and a Phase Transition in the Distribution 
Function 

Next, we consider a driven single particle in a harmonic trap [193] . This particle is weakly 
coupled to a finite system composed of N identical particles, so that the overall system is 
ergodic. The details of the larger system dehne the density of states and therefore 

I3{E). Repeated impulses of short duration act on the particle and drive it away from 
equilibrium. The time scale between impulses is taken to be larger than the equilibration 
time of the particle. In addition, we assume that the coupling of the particle to the rest 
of the system is so weak that during the impulse it can be ignored. This setup can be 
easily generalized to impulses acting on an extensive number of particles. The energy of 
the particle £ (we use this notation to distinguish it from the total energy of the system 
E) between cycles is given by 


£= V + ^ 
2 2 


(256) 


For simplicity, we work in one dimension. In Eq. (256), x is the coordinate of the particle, 


V its velocity, m = 1 its mass, and k is the spring constant. Because the system is ergodic 
and N is large, at any given energy of the system, the probability distribution for (x, v) 
before the impulse is Gibbs: p{x, v) oc exp[—/3(E)e]. We take the impulse magnitude to be 
F(x)5t, with 5t short enough so that, during the impulse, the particle’s position does not 
change appreciably and the coupling to the rest of the system can be ignored. Following 
the impulse, the momentum changes according to p ^ p + E (x) 6t. It is straightforward 
to calculate both the drift Je and the diffusion constant De m- 


Je = ([F(x)]2)^, 




2\ SE 


where the angular brackets denote an average over p(x, v) and r is the time (or average 
time) between impulses. As in the previous example, one could have calculated Je and 
deduced De from the Einstein relation. Note that here, in contrast to the single-particle 
example, the OeDe term in the relation 2Je = (3{E)De + OeDe is a 1/N correction, 
which is negligible. Technically, this correction would amount to the fact that, at finite but 
large N, the single-particle distribution p(x, v) slightly deviates from a Gibbs distribution. 

Next, we consider the probability distribution for the energy of the system after the 
application of many impulses. To proceed, we have to assume a specific form for n(E) 
[or equivalently f3{E)] and for E{x). For simplicity, we take (3 {E) oc E~°‘ and E (x) oc 
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sign(x)|x|'’ so that 


Je oc 


(257) 


where we used (x^) oc I3{E)~^ oc E°‘, with the first proportionality following from the 
equipartition theorem. For large N, due to the central limit theorem, the energy distri¬ 
bution is approximately Gaussian. Therefore, to characterize the distribution, it snfhces 
to find the mean energy as a function of time and its variance as a function of the mean 
energy. 

To find the relation between the mean energy and its variance, to leading order in 1 /V, 
we can multiply the Fokker-Planck equation (185) by E and E'^ and integrate over all 
energies. This yields the following differential equations describing the time evolution of 
{E) and cr^ = {E‘^) — {E)‘^, where angnlar brackets stand for averaging over the probability 
distribution P{E,t): 


dt{E) = {Je) 

dta^ = {De) + 2 {{Je E) - {Je) {E)) . 


Combining these equations yields 

da^ _ {De) + 2 {{Je E) - {Je) {E)) 
d{E) {Je) 


(258) 


Moreover, if the energy distribution P{E) is narrow, as is the case of large N, we can 
evaluate the averages within the saddle-point approximation. Using the Einstein relation 
Je = I^{E)De/‘ 2‘i we obtain; 


da'^ 

W) 


2 

Jm) 


9gJg((E)) 

Je{{E)) 


a\{E)). 


(259) 


Integrating this equation between the initial energy of the system {E)q and its final 
energy {E) gives 


a\{E)) = at 


2 JE{{E)f 


Je{{E), 


+ 24((E)) 


JE) 


dE' 


\e)o jE{E'Ym) 


(260) 


where ao is the initial width of the distribution and {E)o is the initial mean energy. 

We now use this equation in conjunction with the results obtained for the particle in 
a harmonic trap. In that case, the change of the mean energy of the system is given by 


dt{E) = Je{{E)) = c{E)^ . 


(261) 


Note that the values of a (recall that ar = s), which define the specific heat exponents, 
are constrained by thermodynamic reasons to 0 < a < 1. The lower bound is required 
due to the positivity of the specific heat, and the upper bound assures that the entropy 
[(S'(ii') oc is an increasing unbounded function of the energy (the latter condition 

can be violated in systems with bounded energy spectrum). To prevent the energy of 
the system from diverging at finite time, we require s < 1 [as follows from integrating 


Eq. d^ ]. 

Eor simplicity, we assume that uo —)• 0, that is, t hat we are starting from a very narrow 
microcanonical distribution. As done in Sec. 7.1, it is useful to compare the width cj^ 
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to the equilibrium canonical width a‘1 = —dj^{E) ~ /a. This comparison reveals 

that, as the functional form of the impulse (specifically, the value of r) is changed, the 
system displays a transition between two behaviors. To see this, note that a'^{{E)) is 
controlled by the exponent rj = 2Q;r — a — 1 = 2s — a — 1, which determines if the integral 
in Eq. (260) is controlled by its lower or upper bound: (i) When rj < 0, the width is 
Gibbs-like with —)■ 2a/\r]\, that is, the ratio Ict‘1 asymptotically approaches a 

constant value that can be either larger or smaller than one. Smaller widths correspond to 
protocols with large and negative s. Namely, protocols where Jg is a strongly decreasing 
function of the energy, (ii) When r/ > 0, there is a run-away regime. Here, the width 
increases with a higher power of the energy than the canonical width: jo^ ~ E^. The 
resulting distribution is significantly wider than the canonical one. Given the constraint 
on the value of s, this regime can only be reached if a < 1 (in particular, this regime 
is unreachable for a driven classical ideal gas). The transition between the two regimes 
occurs when r/ = 0. In this case, ~ 2aln((E)/(£')o). One can show that close to 

this transition, when \ri\ <C 1, there is a divergent time scale (in terms of the number of 
impulses) required to reach the asymptotic regime, see Ref. |193j for details. Therefore, 
this setup realizes a dynamical transition for the asymptotic energy distribution of the 
system, which is qualitatively similar to a continuous phase transition. The parameter r] 
plays the role of the tuning parameter. 

For concreteness, consider a system with a = 1/2 (such as a Fermi liquid or the one¬ 
dimensional harmonic system above). When r = 1, we have ij = 2ar — a — 1 = —1/2 
and the resulting distribution is Gibbs-like with = 2. When r = 3/2, we are at 

the critical regime r] = 0. Finally, for r = 2, one has = 1/2 leading to the run-away 
regime with d j ~ E^!"^. Note that Eq. (260) implies that the existence of these three 
regimes is generic. In particular, depending on the functional form of Je{E) and 13(E), 
the variance of the distribution can be larger or, surprisingly, smaller than the width of 
the equilibrium Gibbs distribution at the same mean energy. Specifically, d(E)/a'^{E) 
can be made arbitrarily small by a proper choice of Je(E). Also, the existence of the 
dynamical phase transition described for this simple model is only tied to whether the 
integral in Eq. (260) diverges or converges at high energy. The emergence of a nontrivial 
universal asymptotic behavior of the energy distribution is insensitive to the details of 
the driving protocol, such as the driving amplitude and shape of the pulse. 

While in the examples in this section we focused on classical systems, the same con¬ 
clusions apply to driven quantum systems (see Ref. |193j for specific examples). One can 
also anticipate nontrivial universal non-equilibrium distributions in driven systems with 
more than one conserved quantity. In this case, the Onsager relations will be responsi¬ 
ble for constraining the mean flows of the conserved quantities and the flows of their 
fluctuations. 


7.3. Two Equilibrating Systems 


As a final example, we follow Ref. EHj and describe the equilibration of two weakly 
coupled systems, as those shown in Fig. 24 but with no particle exchange (Jn = 0). 
When one of the systems is much bigger than the other one, the bigger system serves as 
a heat bath for the small system and the two systems equilibrate at the temperature of 
the bath, which does not change during the equilibration process. The situation changes, 
however, when the two systems are comparable to each other, so that both systems are 
affected by the heat exchange. Assuming that the energy flow between the two systems 
is much slower than the characteristic relaxation time of each of the systems, and that 
the total energy Etot is not fluctuating, we can again use the Fokker-Planck formalism 
and the analysis in the previous example. The only difference is that, instead of 13(E), 
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we need to use Al3{E) = (3i{Ei) — (3ii{Eii), where /3i and /3ii are the temperatures of 
systems I and II, respectively, and Ei and -Bn are their respective energies. The latter 
satisfy the constraint Ei + En = .Btot = const. Then, instead of Eq. (260), we find the 
following expression for the energy fluctuations in system I; 




^(Ei> 


'(EOe JE{E'Y[l5i{E')-^n{E,ot-E')] 


dE' 


(262) 

where Je is the rate of the heat flow into system I. This equation describes the evolution 
of the width of the energy distribution of system I. As the system equilibrates, one 
expects that 


Je = C{E-EP), (263) 

where E^'^ is the equilibrium steady-state mean energy of system I, for a given total 
energy -Etot- Likewise 


(3i{E') - Ai(Etot - E') ~ ^ + 


a 


I,c 


a- 


(E'-Ef^), 


(264) 


II,c 


where ctj ^ = —dEi/d/3i is the variance of the energy distribution in the canonical ensem¬ 
ble of system I with mean energy E^^ and, similarly, iTjj ,, is defined for system II. The two 
expressions above ensure that, in the steady state, when Ei = E^"^, the (average) heat 
flux is zero and the temperatures of the two subsystems are identical. By substituting 


this expansion in Eq. (262), we find the asymptotic result for the energy fluctuations in 


system I after equilibration: 


^2 _ ^2 
CJl — (Til 




a 


II,c + 


(265) 


If one of the systems, say system I, is much smaller than the other, then this result 
simply implies that, after equilibration, the energy fluctuations in both systems are given 
by the canonical energy fluctuations of the smallest system. If the two systems are 
identical, then the energy fluctuations in either one of the systems are equal to one half 
of the canonical ones. 


Equation (262) is, however, more general and can be used to study the full evolution of 


the distribution as the systems equilibrate (and not just the approach to the asymptotic 
result). Again, following Ref. |214j . let us consider a specific example of a gas of hard 
spheres in a box. They are simulated using an event-driven molecular dynamics |227j . 
The gas has Ni particles of mass mi and Nu particles of mass mu, all of equal size. 
These groups of particles represent the two systems I and II. This setup is similar to 


the Lorentz gas analyzed in Sec. 7.1, which in turn is identical to a single particle in a 
chaotic cavity. The difference is that here the two masses are both finite while, in the 
Lorentz gas analyzed earlier, one type of atoms was infinitely heavier than the other. 
It is straightforward to check studying the collision between two particles that, if the 
two masses are very different, the energy transfer in each collision is small. In this case, 
a significant energy transfer occurs only over many collisions. Numerical simulations 
were repeated for many runs to evaluate the width of the distribution as a function 
of the average energy. In addition, during the evolution, the energy transfer between 
the two systems was evaluated and Eq. (262) was used to compute the width of the 
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Figure 28. Equilibration of a system consisting of 50 particles with two different masses confined in a box. We 
plot results for (Ei) vs (dots), and compared them to the theoretical prediction in Eq. l |262[ | (solid line). Inset: 
Energy Ei{t) in a single run (solid line), and the average energy {Ei){t) (dashed line). The particle numbers 
used in the simulations are Ni = 30 and Nn = 20. The initial velocities are sampled from a Maxwell-Boltzmann 
distribution with very different initial temperatures: /3i = 60 and /3n = 3. The total energy constraint is enforced 
by a (small) rescaling of the velocities of the mu-particles. The masses are chosen to be mj = 10“^^ and mu = 1 
(in arbitrary units). The box is a unit cube with reflecting boundaries, and the added volume of the particles is 
taken to occupy a 5% of the volume of the box. 


distribution. The results, shown in Fig. 28, indeed confirm the predictions of the Fokker- 
Planck derivation based on the Einstein relation for the open systems (187). 


8. Integrable Models and the Generalized Gibbs Ensemble (GGE) 


One of the focuses of this review has been understanding what happens in isolated non- 
integrable quantum systems that are taken far from equilibrium by means of a sudden 
quench. We have discussed the relaxation dynamics of physical observables and their 
properties after relaxation. We explained that quantum chaos, through eigenstate ther- 
malization, is the reason behind thermalization in those systems. In this section, we 
briefly discuss what happens in integrable systems. Such systems do not exhibit eigen¬ 
state thermalization. 

We note that the very dehnition of quantum integrability is a topic of debate (see, e.g.. 
Ref. [228H23(j] ). but we will not touch upon that here. A rather standard definition of 
quantum integrability, based on the existence of an extensive number of local operators 
(or, more precisely, operators that are extensive sums of local operators) Ik that commute 
with the Hamiltonian and with each other, will be sufficient for the discussion here. The 
requirement that the conserved quantities are local/extensive is essential and it excludes 
the projection operators to the eigenstates of the Hamiltonian. In fact, for any quantum 
Hamiltonian, integrable or not, the projection operators to its eigenstates commute with 
the Hamiltonian and with each other, that is, they are conserved, but they are neither 
local nor extensive. As we have argued for quantum chaotic systems, the existence of 
those conserved quantities does not preclude the thermalization of physical observables. 
Exactly the same can be said about higher moments of the Hamiltonian, which are 
separately conserved but play no role both in equilibrium thermodynamics and in the 


thermalization of chaotic systems (see, e.g., the discussion in Sec. 4.3.2) 


8.1. Constrained Equilibrium: the GGE 

The main difference between chaotic and integrable systems becomes apparent already 
at the single-particle level, see Eig. In the classical chaotic billiard any trajectory, after 
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some time, uniformly fills the available phase space at a given energy. As a result, the 
long-time average and the microcanonical ensemble average of an observable agree with 
each other. In contrast, in the integrable cavity the particle’s motion is constrained by 
other conserved quantities and the time average and the microcanonical ensemble average 
need not agree. Nevertheless, the particle might still uniformly hll the available phase 
space. This means that the long-time average could still be described by an ensemble 
average, but it needs to be a generalized microcanonical ensemble that accounts for all 
conserved quantities in the system |231j . 

In the quantum language, this amounts to saying that, in order to describe time av¬ 
erages of observables in integrable systems, one needs a constrained ensemble which is 
built using eigenstates of the Hamiltonian involved in the dynamics (they are selected 
by the initial state). As in classical systems, in which conserved quantities preclude the 
exploration of all phase space, the failure of integrable quantum systems to exhibit eigen¬ 
state thermalization can be traced back to the fact that they have an extensive number 
of nontrivial (local/extensive) conserved quantities {Ik}, as opposed to the 0(1) num¬ 
ber of extensive conserved quantities in nonintegrable systems (energy, momentum, etc). 
Despite the existence of the conserved quantities {Ik}, and because of dephasing (like 
in nonintegrable systems), observables in integrable systems are expected to relax to 
stationary values and remain close to those values at most later times. 

Remarkably, in Ref. 1M2], the previous statements were shown to hold for an inte¬ 
grable model of hard-core bosons. Instead of a generalized microcanonical ensemble, a 
generalized grand canonical one was introduced in that work. It is now known as the 
GGE, whose density matrix: 


Pgge = 


exp(-Efc^fc4) 

Tr[exp(- Afc4)] ’ 


(266) 


was obtained by maximizing the entropy [23311234j under the constraints imposed by 
conserved quantities that make the system integrable. The values of the Lagrange mul¬ 
tipliers were determined by requiring that, for all k’s, Tr[pGGE4] equals the expectation 
value of Ik in the initial state. It is a priori not obvious that the exponential form in 
Eq. ( 266[ ) is warranted. For extensive integrals of motion, one can justify the exponential 
form in the same way as it is done in traditional statistical mechanics, namely, noting 
that: (i) because of the equivalence of ensembles for subsystems, which is a direct conse¬ 
quence of the extensivity of the conserved quantities, the precise form of the distribution 
is not essential, and (ii) the exponential distribution leads to statistical independence of 
subsystems, which is naturally expected after relaxation in a system governed by a local 
Hamiltonian. 

The fact that observables in integrable systems do not, in general, relax to the same 
values seen in thermal equilibrium, and that the GGE describes few-body observables 
after relaxation, has been verified in a large number of studies of integrable models. These 
can be either solved numerically for much larger system sizes than those accessible to 
full exact diagonalization or analytically solved in the thermodynamic limit |131l 11331 
12321 l235H255j . We should emphasize, however, that special initial states may still lead to 
nearly thermal expectation values of observables in integrable systems after relaxation 
[1361113711145L123611256H259] . Hence, finding nearly thermal results for some initial states 
does not automatically mean that a system is nonintegrable. In what follows we discuss 
several examples for which the GGE can be used. These will highlight the reasons for its 
applicability. 
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8.1.1. Noninteracting Spinless Fermions 

Noninteracting systems are possibly the simplest class of integrable systems. Let us 
discuss one particular noninteracting system that clarifies some important features of 
the GGE introduced above. We focus on noninteracting spinless fermions in a one¬ 


dimensional lattice (relevant to the hard-core boson system discussed in Sec. 8.1.2) 


L-l 




UiU 


3 


(267) 




i=i 


where /| (/j) is a fermionic creation (annihilation) operator at site j^fij = fjfj is the site 
j occupation operator, J is the hopping parameter, and Uj are arbitrary site potentials. 

The single-particle Hamiltonian (267) can be straightforwardly diagonalized: 
HsF 7 fc| 0 ) = where are the single-particle eigenenergies, \k) = yjjlO) are the 

single-particle eigenstates, and k = 1,2,...,L. The occupations of the single-particle 
eigenstates = 7 ]!7;. immediately form a set of L nontrivial nonlocal conserved quan¬ 
tities for a system consisting of many noninteracting spinless fermions. These conserved 
quantities are not extensive. However, carrying out the GGE analysis for this set of 
conserved quantities, imposing that Tr[pGGE%] = = vIj finds that the 

Lagrange multipliers are given by the expression [232j 


Afc = In 


vi 


(268) 


that is, the Lagrange multipliers are smooth functions of 


In Eig. 29, we show results for r/(, and Xk after a quench in which a superlattice potential 


Uj = in Eq. (267)] is turned off. In this quench the initial state is taken to be the 


ground state for u/ y 0 and the time evolution is carried out under a final Hamiltonian 
with u = 0. An important feature, made apparent by the results in Eig. 


29 


is that 

increasing the system size by a factor of 10 leads to essentially the same curve for rjl 
vs k but with 10 times the number of data points. This smooth dependence of A on 
k' = k/L allows one, for sufficiently large system sizes, to define extensive integrals of 
motion by taking the sum Yjk'&[k"-5k"/ 2 ,k"+ 5 k"/ 2 ] ^k'Vk’ as being equal to Xk"fjk" where 
now = Yl,k'&[k"-&k"/ 2 k"+&k"/ 2 ]'^k' is extensive. Hence, the mode occupations % can 
be thought of as being extensive in a coarse-grained sense and the justification of the 


GGE exponential form [see Eq. (266)] presented above remains valid |131l 11331 I136| . 
The formal equivalence between the GGE constructed using occupation modes (as done 
here) and using extensive conserved quantities was established in Ref. |252j . It extends 
beyond noninteracting systems to integrable models that may or may not be mappable 
to noninteracting ones. 

Having justified the applicability of the GGE to the occupation modes of the single¬ 
particle Hamiltonian of noninteracting many-particle systems, one can go a step further 
and prove that the GGE defined this way provides exact results for the time average 
of all one-body observables (without finite-size errors) |132l I260j . The proof, following 
Ref. |132j . is straightforward. Projecting the many-body time-evolving wave function 
p{t) = |?/)(t))(V’(t)| onto the one-body sector, and using the fact that all eigenstates of 
the many-body Hamiltonian are (antisymmetrized) direct products of the single-particle 
states |A:), the time evolution of the one-body density matrix can be written as 


/5ob(t) = 


(269) 


k,k' 
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k/L k/L 

Figure 29. (a) Expectation value of the conserved quantities in quenches from uj ^ 0 (as indicated in the figure) 

to u = 0. (b) The corresponding Lagrange multipliers. The conserved quantities are ordered such that increases 
with increasing k. The results denoted by symbols (lines) correspond to systems with L = 38 (380) sites at 
half-filling (A^ = -^/2)- Adapted from Ref. | 136 |. 


In the absence of degeneracies in the single-particle spectrum, the infinite-time average 
of Poh{t) can be written as 


_ rt' _ 

/5ob(t) = lim - / dtpohit) = '^r]l\k){k\, (270) 

t'^OO t' Jq ^ 

which is, by construction, the one-body density matrix within the GGE, as rjl = 
En K^lV’i)P%,n = tilpGGEVk], where r]k,n = 1 (%,n = 0) if the single-particle state 
\k) is (is not) part of the particular many-body state |n), and Tr[/iGGE7^7^/] = 0 for 
k ^ k'. We should emphasize at this point that this does not mean that all one-body 
observables equilibrate at their GGE values (they do not, see Refs. |132( I2.'I8| I255( [26n] ). 
but simply that their time average is given by the GGE prediction. 


8.1.2. Hard-Core Bosons 

We now turn our attention to hard-core bosons, described by the Hamiltonian 


L-l 


Hrcb = {^h+^ “*■ 


i=i 


i=i 


(271) 


where S) (bj) is the hard-core boson creation (annihilation) operator at site j, h’j = b^jbj 
is the site j occupation operator, J is the hopping parameter, and Uj are arbitrary site 
potentials. Hard-core bosons satisfy the same commutation relations as bosons but with 
the additional constraint that there cannot be multiple occupancy of any lattice site, i.e., 

ib]? = b] = om- 

The hard-core boson Hamiltonian above can be mapped onto a spin-1/2 chain through 
the Holstein-Primakoff transformation |33L I261j and the spin-1/2 chain onto the spinless 
fermion Hamiltonian in Eq. (267) via the Jordan-Wigner transformation |33L I262j 




m 

£=1 




i-1 

H 

£=l 




fv 




n]. 


(272) 
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Figure 30. (Main panels) Site (a) and momentum (b) occupations in a trapped integrable system of hard-core 
bosons after a quench in which the initial state is the ground state of a trapped system (vj ^ 0) in the presence 
of a superlattice potential {uj ^ 0) and the dynamics is carried out in the presence of the same trap {v = vj) 
but with no superlattice potential {u = 0), see text. Results are presented for the time average of the occupations 
after relaxation, as well as for the GGE and the grand canonical ensemble (GE) predictions. The insets show 
the dynamics of the occupation of site 251 (a) and of the zero momentum occupation (b). The horizontal lines 
correspond to the results in the GGE and grand canonical ensemble, as depicted in the main panels. The system 
has 900 sites and 299 hard-core bosons. Times are reported in units of /l/J, and k is reported in units of 1/a, 
where a is the lattice spacing. See also Ref. 1^ . 


This means that the dynamics of the hard-core boson site occupations is identical to 
that of the fermions, but the momentum distribution function of hard-core bosons, which 
involves a Fourier transform of one-body correlations in real space, is very different from 
that of the fermions. The hard-core momentum distribution function can be efficiently 
calculated for eigenstates of the Hamiltonian [263], for systems out of equilibrium |264j . 
and in the grand canonical ensemble |265j using properties of Slater determinants. The 
conserved quantities to construct the GGE can be taken to be the same as for the 
noninteracting spinless fermions in Sec. 8.1.1[ namely, single-particle mode occupations. 

In Fig. 30, we show results for an integrable model of hard-core bosons in the presence 
of a harmonic trap after a quench in which a superlattice potential is turned off pH] - 
in this case, Uj = v(j — T/2)^ -|- u{—iy, where v and u set the strength of the harmonic 
trap and the superlattice potential, respectively. In the main panels, one can see the time 
average of the site (a) and momentum (b) occupation profiles after relaxation. They are 
clearly different from the predictions of a grand canonical ensemble for a system whose 
Hamiltonian is the one after the quench. The temperature and chemical potential of the 
grand canonical ensemble are fixed so that the mean energy and number of particles 
match those of the time-evolving state. We note that the system considered here is 
large enough so that the observed differences between the time-averaged profiles and 
the thermal predictions are not due to finite-size effects |236j . On the other hand, the 
predictions of the GGE, which like the grand canonical ensemble also has energy and 
particle number fluctuations, are indistinguishable from the results of the time average. 
The insets exemplify the relaxation dynamics by depicting the time evolution of the 
occupation of one site [Fig. 


j0[a)] and of the zero momentum mode [Fig. 30 b)]. They 
can both be seen to relax towards, and oscillate about, the GGE prediction. The grand 
canonical ensemble results are clearly incompatible with the results after relaxation. 
The amplitude of the fluctuations about the time average reveals another qualitative 
difference between nonintegrable and integrable systems. In quenches involving pure 
states in isolated integrable systems mappable to noninteracting models (and in the 
absence of localization), time fluctuations only decrease as a power law of the system 
size |131L I245[ I255[ 126011266[ I267j . This is to be contrasted to the exponential decrease 
of the amplitude of the time fluctuations as a function of the system size expected, and 
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seen [185], in nonintegrable systems. In Ref. |185j . it was argued based on numerical 
experiments that the time fluctuations of observables in integrable systems that are not 
mappable to noninteracting ones can also decrease exponentially fast with increasing 
system size. 


8.2. Generalized Eigenstate Thermalization 


One may ask at this point why is it that the GGE is able to describe observables after 
relaxation in isolated integrable systems following a quantum quench. After all, Eq. ( |60| ) 
is still dictating the dynamics and, once eigenstate thermalization does not occur, one 
might expect that the results after relaxation will depend on the exponentially large (in 
the system size) number of parameters Cn = {n\'ipi) that are set by the initial state while 
the GGE depends only on a polynomially large number of parameters. As discussed 
in Ref. m, the validity of the GGE can be understood in terms of a generalization 
of eigenstate thermalization in integrable systems. Namely, if eigenstates of integrable 
Hamiltonians with similar distributions of conserved quantities have similar expectation 
values of physical observables (we call this phenomenon generalized eigenstate thermal- 
izatior^^), then the GGE will describe those observables after relaxation following a 


quench. This can be understood as follows. In the diagonal ensemble (after quenches to 
integrable systems), the fluctuations of each extensive conserved quantity are expected 
to be subextensive for physical initial states - as we showed for the energy in Sec. 4.3.2 


This, together with the fact that the GGE is constructed to have the same expectation 
values of conserved quantities as the diagonal ensemble and combined with generalized 
eigenstate thermalization, is what leads to the agreement between the GGE and the 
results after relaxation. 

Numerical evidence for the occurrence of generalized eigenstate thermalization was 
presented in Refs. for integrable hard-core boson systems (similar to those in 

Fig. 30), and in Ref. [133] for the transverse field Ising model. In addition, for some ob¬ 
servables, the occurrence of generalized eigenstate thermalization was proved analytically 
in Ref. |133j . In what follows, we review results for the transverse field Ising model. 

The relation between the transverse field Ising model and the hard-core boson model 


in Eq. (271) can be understood as follows. The hard-core boson chain can be mapped 


onto a spin-1/2 chain via: 


= sf = il. s- = h,. 


(273) 


After these substitutions, using that = Sj + iSj and Sj = Sj — iSp the Hamilto¬ 


nian (271) reads 


L-l L 

Hxx = -2J +Y.u,[S^ + ^ 

j=i i=i 


'0 \ ‘"3 ' 2 


(274) 


This Hamiltonian is known as the isotropic XY model (or the XX model) in a transverse 
field. Its anisotropic version, for periodic boundary conditions in the presence of a uniform 


®^What changes from nonintegrable to integrable Hamiltonians is that, in the latter, the expectation values of 
few-body observables in any eigenstate are determined by the values of all conserved quantities and not just the 
energy. 
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Figure 31. Density plots of the weights of the eigenstates of the transverse field Ising Hamiltonian [Eq. ( |275| | with 
7 = 1 and J = 1] as a function of their eigenenergies and of the eigenstate expectation values of the next-nearest 
neighbor correlation function Pa^nels (a)-(c) depict the weights in the diagonal ensemble and panels (d)-(f) 

depict the weights in the GGE, in each case for three different system sizes. The initial state is the ground state 
for hj = 0.1, and after the quench h = 1.5. Black pixels mark the presence of eigenstates (with vanishing weight), 
while gray pixels signal their absence. Golored pixels show the nonvanishing weights in the diagonal ensemble 
[panels (a)-(c)] and in the GGE [panels (d)-(f)]. These results were provided by Lev Vidmar (see also Ref. [133]). 


field of strength h, can be written as 

L 


Hxy = - J 


(1 + 7 ) 


f=i 




(275) 




i=i 


where we used that 5“ = ( 7 '“/ 2 , and 7 is the anisotropy parameter. In the hard-core 
boson language, 7 7 ^ 0 leads to non-number-conserving terms of the form -|-H.c.). 

In the extreme anisotropic limit 7 = 1, the model in Eq. (275) is known as the transverse 
field Ising model [268j . 

In Fig. we show results for the weights of the many-body eigenstates of the trans¬ 


verse field Ising model (color coded in the scale on the right) as a function of the energy 
of the eigenstates and of the eigenstate expectation values of Hi® lop panels, 

we show weights in the diagonal ensemble and in the bottom panels we show weights 
in the GGE, in each case for three different system sizes. The black regions mark the 
existence of eigenstates with the corresponding eigenenergies and eigenstate expectation 
values, but with vanishing weight in the ensembles. The fact that those black regions do 
not narrow with increasing system size (they can be seen to slightly widen) is to be con¬ 


trasted to the results in Fig. 13 for nonintegrable systems. The contrast makes apparent 
that in the transverse field Ising model eigenstate thermalization does not occur. 

More remarkably. Fig. shows that the eigenstates of the Hamiltonian with a sig¬ 
nificant weight in the diagonal ensemble and the GGE are located in approximately the 
same small region in the plane defined by the eigenstate energies and expectation values 
of (see Ref. |133j for similar results for other observables). In Ref. |13,3| . it was 

shown that, in both ensembles, the width of the energy distribution and of the distri- 
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bution of expectation values of vanishes as IjyfL with increasing system size. If 

one adds to this finding the fact that, in both ensembles, the mean value of the energy 
is the same by construction and the expectation value of is found to agree, as 

in previous examples, one concludes that the eigenstates of the hnal Hamiltonian that 
determine the results in the diagonal ensemble and in the GGE in the thermodynamic 
limit are located at the same point in the aforementioned plane. Generalized eigenstate 
thermalization is reflected by the fact that the width of the distribution of eigenstate ex¬ 
pectation values vanishes with increasing system size. These results make apparent that 
the exact distribution of weights in the diagonal ensemble and the GGE is irrelevant, the 
overwhelming majority of the states they sample have the same expectation values of 
the observable. This is why the GGE can predict the expectation values of observables 
in integrable systems after relaxation following a quench, even though the number of 
parameters required to construct the GGE increases polynomially with the system size 
while for the diagonal ensemble it increases exponentially with the system size. In this 
spirit, in Ref. |249| it was argued that a single representative state is sufficient to describe 
the relaxed state of integrable systems after a quench in the thermodynamic limit. This 
statement is indeed very reminiscent of ETH for nonintegrable systems. 


8.2.1. Truncated GGE for the Transverse Field Ising Model 


As for hard-core bosons, for the transverse field Ising model in Eig. 31 the GGE was 
constructed using occupations of single-particle fermionic quasiparticles (Bogoliubov 
fermions) |133j . Alternatively, one can construct a different equivalent set of integrals 
of motion, which are explicitly local and extensive. Following Ref. |252j . these integrals 
of motion can be ordered according to their locality and come in pairs and , such 


that contains sums of products of up to A:-|-2 neighboring spin operators [252]. They 
can be written as 




n = - 


•^E(« 


’ij+2 ) 


'•E('S5+i + 'S”+.). 


-'n>2 ~ 


JEf'SS+wi+'5S+«-i) - '=E('S5+«+'SS+»). 


v = - 




(276) 




where 


An important question one might ask is, given a desired accuracy for some observable, 
how many conserved quantities are needed for the GGE to describe the result after 
relaxation. On this point, the locality of the observable and of the conserved quantities 
included in the GGE turn out to be crucial. For the model above. Fagotti et al. showed 
numerically that if one is interested in the reduced density matrix of a subsystem of size 
L', then only the first L' integrals of motion: with k < L', that is, the integrals 

of motion that can “fit” on the subsystem, are important |252| . All other “less local” 
integrals of motion have an exponentially small effect on the subsystem (see Fig. 32). 
This is expected to be generic in integrable models, whether they are mappable or not 
to noninteracting ones. 
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in the truncated GGE obtained by imposing local conservation laws with support in at most y + 1 consecutive 
sites, for a particular quench in the transverse Ising model. The subsystem size ranges from / = 5 to / = 50. The 
color and size of the symbols in the figure change gradually as a function of the subsystem size 1. For y > the 
norm distance decays exponentially in with an /-independent decay constant. From Ref. 1^ . 


8 . 3 . Quenches in the XXZ model 

An area of much current interest within the far from equilibrium dynamics of integrable 
systems is that of quenches in models that are not mappable to noninteracting ones. 
One model in this class, which is particularly important due to its relevance to exper¬ 
iments with ultracold bosons in one-dimensional geometries, is the Lieb-Liniger model. 
Studies of quenches within this model, in which repulsive interactions were suddenly 
turned on, revealed that the expectation values of conserved quantities diverge. As a 
result, a straightforward implementation of the GGE is not possible |269t 127(1] . A lattice 
regularization for this problem was discussed in Refs. |254t I269| (generalized eigenstate 
thermalization was argued to occur in this regularized model |254) ). Despite progress in 
constructing GGEs for field theories |271) . an explicit construction of the GGE for the 
Lieb-Liniger is still lacking. 

Another model that has attracted much recent interest, and which is the focus of this 
subsection, is the XXZ model 


Hxxz = —J ^ 
i=i 




+ Aa^d: 


i^3+^ 


(277) 


This model is, up to a possible boundary term, mappable onto the models in Eqs. (40) 


and (55) when J' = V = 0. Studies of quenches in the XXZ model revealed that the 


GGE constructed using all known local conserved quantities at the time failed to describe 
few-body observables after relaxation |272H275] . In Fig. [SSfa), we show results for the 
next-nearest neighbor correlation (df (t|) in this model after a quench from an initial Neel 
state to a finite value of the anisotropy parameter A [equivalent to Vj (2 J) in Eq. (55) for 
J' = V' = Q]. The results expected for that correlation in the steady state, which were 
obtained in the thermodynamic limit using Bethe ansatz, ((df^d|)sp, continuous line) 
are almost indistinguishable from the GGE results ((dfd|)GGE) dashed line). However, 
plotting the relative difference h((Tfd|) = ((dfdDGGE — (dfcf|)sp) / |(dfd|)sp|, see the 
inset in Fig. |33[a), reveals that they are not identical. The differences are largest close 
to the isotropic Heisenberg point. Near that point, calculations using numerical linked 
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Figure 33. Quenches in the XXZ chain in the thermodynamic limit, (a) Next-nearest neighbor correlations 
following a quench starting from a Neel state to a finite value of the anisotropy parameter A. Results 
are reported for the expected steady state obtained using Bethe ansatz (solid line) and for the GGE prediction 
(dashed line), (b) Gomparison between the steady state (d^dDsp, the GGE (dJdDGGEi NLGE for the 

diagonal ensemble (d|^d|}NLCE results close to the isotropic point. The GGE results are seen to depart from the 
others. Error bars in the NLGE data display an interval of confidence. Inset in (a): Relative difference between the 
steady-state Bethe ansatz result and the GGE, 5{a^a^) = ((d|^d|)GGE ~ (^i^ 3 }sp) / | (‘^i'^ 3 }sp|• From Ref. 


cluster expansions (NLCEs) for the diagonal ensemble [135j after the same quench [276] 
agree with the steady-state predictions obtained using Bethe ansatz, see Fig. 33 b). The 
discrepancy between the GGE results and the others suggested that, given the set of 
conserved quantities selected, generalized eigenstate thermalization did not occur in this 
model (in Ref. |277j . it was argued that it fails for integrable models that support bound 
states). Hence, other local conserved quantities (not known at the time) were expected to 
also be important. The extra conserved quantities needed were recently found |278| 1279] 
and the GGE constructed using them has been shown to describe the steady state of 
observables after relaxation following the quench for the XXZ model described above 

It is important to emphasize at this point that we expect generalized eigenstate ther¬ 
malization to be a generic phenomenon in integrable systems (as eigenstate thermal¬ 
ization is in nonintegrable systems) and that, as a result, GGEs allow one to describe 
observables in integrable systems after relaxation. However, in contrast to nonintegrable 
systems in which the conserved quantities are trivial to find and, as a result, traditional 
statistical mechanics can be used almost as a black box, the same is not true in integrable 
systems. For the latter, a careful analysis needs to be done (specially for models that 
are not mappable to noninteracting ones) in order to identify the appropriate conserved 
quantities needed to construct the GGE. 

In all quenches discussed so far for integrable systems, mappable or not mappable 
to noninteracting ones, the initial states were taken to be eigenstates (mostly ground 
states) of an integrable model. One may wonder whether the lack of thermalization we 
have seen in those quenches is a result of the special initial states selected. In order to 
address this question, NLGEs were used in Ref. |13nj to study the diagonal ensemble 
results (in the thermodynamic limit) for observables in quenches to the J' = V' = ^ 
hard-core boson model in Eq. (55), which is the XXZ model (277) written in the hard¬ 
core boson language. The initial states for those quenches were taken to be thermal 
equilibrium states of Hamiltonian (55) for A = = V/ y 0, that is, thermal equilibrium 

states of a nonintegrable model. Those are the kind of initial states that one expects to 
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Figure 34. NLCE results after quenches in which the initial state is a thermal equilibrium state of the nonin- 
tegrable Hamiltonian l |55[ l with A = Jj = Vj ^ 0 and the system is quenched to J' = V' = 0, that is, to the 
integrable XXZ model written in the hard-core boson language. J = V = 1 remains unchanged during the quench, 
(a) Relative entropy differences 5Si [see Eq. | |278[ l] and (b) relative momentum distribution differences Smi [see 
Eq. ( |279| |] vs the order I of the NLCE for the diagonal ensemble. Results are shown for Tj = 2 and six values of 
A. SSi and 5mi for A = 0, that is, in the absence of a quench, decrease exponentially fast with the order I of the 
NLCE. All other differences saturate to a nonvanishing value reflecting lack of thermalization. Similar results were 
obtained in Ref. [SQ] for other initial temperatures and observables. Adapted from Ref. [130]. 


have usually in experiments. 

In Fig. [Ml we show results for the relative entropy differences 


SSi 


cGE qDE 
*^18 


cGE 

*^18 


(278) 


between the grand canonical ensemble (GE) and the diagonal ensemble (DE) predictions, 
and the relative momentum distribution differences 


6mi 




(279) 


also between the GE and the DE predictions, plotted as a function of the order I of the 
NLCE for the diagonal ensemble. The initial states were taken to have a temperature 
Tj = 2J (the results for other initial temperatures are qualitatively similar [13U| i. After 
the quench, the temperature and chemical potential in the grand canonical ensemble 
were hxed so that the mean energy and number of bosons per site agree (up to machine 
precision) with those in the diagonal ensemble. The NLCE was carried out up to order 
I = 18. 5^^ and were checked to be converged to the thermodynamic limit result 

up to machine precision (see Ref. m for details). 

The results for 5Si and 5mi in Eig. are qualitatively similar. For A = 0, that is, 
in the absence of a quench {Jj = Vj = J' = V = 0), one can see that 6Si and 6mi 
vanish exponentially fast with increasing I, i.e., the convergence of the NLCE expansion 
to the thermodynamic limit result is exponential in 1. However, as soon as A 7 ^ 0, that 
is, as soon as there is a quench, the differences saturate to a nonzero valueThis means 
that the entropy and the momentum distribution function in the diagonal ensemble are 
different from their grand canonical counterpart in the thermodynamic limit. This is 
to be contrasted with the opposite quench, from integrable to nonintegrable points, for 
which the numerical results are consistent with vanishing dSi^oo and dmi^oo USD]. For 
quenches to the integrable point, the fact that SSi^oo 7 ^ 0 [as suggested by Fig. [34|^a)] 


^^The quantitative difference between the values at which each relative difference saturates is related to the 
fact that SSi^cc, oc A^ while 5mi^^ oc A. This is something that was argued analytically and demonstrated 
numerically for SSi=is in Ref. |13Q| . but that it is not important for the discussion here. 
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means that the energy distribution is not a smooth Gaussian function (or else 5Si^co = 0; 


see Sec. 5.3.1). Hence, the sparseness of the energy density appears to be a generic feature 


in physically relevant quenches and not a consequence of specially fine-tuned initial states. 
This is why, after the quench, observables such as the momentum distribution function 
do not thermalize 7 ^ 0 as suggested by Fig. 34'b)]. The latter phenomenon 

also appears to be generic. These results highlight how careful one needs to be when 
using typicality arguments |142H144] . Those arguments might lead one to conclude that 
a fine-tuning of the initial state is needed for integrable systems not to thermalize after 
a quench, while this appears not to be the case in physically relevant situations. 


8 . 4 . Relaxation of Weakly Non-Integrable Systems: Prethermalization and 
Quantum Kinetic Equations 


Integrable systems are unlikely to be found in nature. Nevertheless there are many ex¬ 
amples of models which are nearly integrable, where the integrability breaking terms 
are irrelevant for relatively long times. As early as in 1834, Scott Russel observed the 
soliton created by a boat in a narrow canal near Edinburgh |280j . In our language, the 
solitary wave is an example of a macroscopic non-thermalizing perturbation. It was not 
until 30 years later that it was realized that this phenomenon can be attributed to the 
integrability of the Korteweg-de Vries (KdV) equation, which approximately describes 
water waves in narrow one-dimensional channels |281j . Since the KdV equation only pro¬ 
vides an approximate description of the problem, one can expect that after long times 
the soliton will decay and the system will thermalize. Similarly, in a recent experiment 
with ultracold atoms [38], the lack of thermalization of the one-dimensional bosonic gas 
was attributed to the integrability of the Lieb-Liniger model that quite accurately de¬ 
scribes those systems [33]. Like the KdV equation, the Lieb-Liniger model provides only 
an approximate description of the experimental system and there are various integra¬ 
bility breaking corrections that need to be taken into account at long times (see, e.g.. 
Ref. [282]). 


For nearly integrable systems, one can naturally expect a relatively fast relaxation 
to an approximate steady state determined by the integrable model, and then a much 
slower relaxation to the true thermal equilibrium. Such a scenario is now known under 
the name of prethermalization. This term was introduced by Berges et al. in the con¬ 
text of cosmology [283] . though the ideas of multi-time thermalization are much older. 
Recently, several different prethermalization scenarios have been explored both theoret¬ 
ically and experimentally. Just to name a few: relaxation of weakly interacting fermions 
after an interaction quench |183L 12371 I284H287] . prethermalization plateaus in various 
one-dimensional nonintegrable systems [391 I288H293] . prethermalization in interacting 
spin systems [294] . two-dimensional superfluids with slow vortices and other topological 
defects |295It297| . prethermalization after turning on a long-range interaction in a spin¬ 
less Fermi gas in two dimensions [298], the emergence of nonthermal fixed points, and, in 
particular, the emergence of turbulence |29911301] . On the latter, it is actually interesting 
to note that a GGE based on momentum occupation numbers (for the limit of weakly 
interacting particles) can be used to explain Kolmogorov’s law [299] . 

In few-particle classical systems, the KAM theorem ensures that chaotic motion does 
not appear immediately after one breaks integrability. Instead, one can have coexis¬ 
tence of regions of chaotic and regular motion. As the strength of integrability break¬ 
ing perturbation increases, chaotic regions spread and eventually occupy all available 
phase space (see, e.g.. Fig. Al). It is not known whether a similar scenario is realized 


in many-particle systems. If situations like that exist, so that an extensive number of 
integrals of motion can survive small integrability breaking perturbations then, instead 
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of prethermalization, one can anticipate relaxation to a GGE defined with respect to 
deformed integrals of motion. Such deformations have been discussed in the literature 
for transitionally invariant integrable systems with small integrability breaking perturba¬ 
tions [MllMlBns], and, in the context of many-body localization, for weakly interacting 
disordered systems At the moment, it is unclear whether in non-disordered 

extended systems (either classical or quantum) in the thermodynamic limit there can 
be a finite threshold for ergodicity breaking. Thus, it is not clear whether the emerged 
deformed GGEs can only describe transient states (although potentially long lived) or 
can represent true steady states. The former scenario is probably more generic but we 
are not aware of any strong evidence for it. 

Arguably, the most successful approach for describing relaxation of weakly inter¬ 
acting (i.e., weakly nonintegrable) systems to equilibrium is the kinetic theory (see, 
e.g.. Ref. [US]). Recently, Stark and Kollar [306] derived kinetic equations using time- 
dependent perturbation theory applied to the GGE. These equations describe the relax¬ 
ation from a prethermalized GGE to a thermal state. Below, we closely follow that work, 
extending it to arbitrary integrable systems. 

Let us assume that we have an integrable system described by the Hamiltonian Hq 
and a weak integrability breaking perturbation H' 

H = Hq + H\ (280) 


The Hamiltonian Hq commutes with a set of mutually commuting linearly independent 
integrals of motion Ik, i.e., [Ho,Ik] = 0. Eor example, in the spirit of what was discussed 
in Sec. 8.1.1, in a system of interacting fermions or bosons, these integrals of motion can 


be the occupations of the single-particle eigenstates. 

Let us now assume that the system is prepared in some nonequilibrium initial state, for 
example, by a quantum quench, and we are interested in its relaxation. If H’ is a weak 
perturbation of Hq, then, at short times after a quench, the system “does not know” that 
it is nonintegrable and the effect of H' on the dynamics is small and the system relaxes 
to an appropriate GGE, possibly described by deformed integrals of motion of Hq. This 
was found to be the case numerically in several systems (see, e.g.. Refs. [29211306] .! At 
long times, H' is expected to lead to relaxation to thermal equilibrium. 

Since H' is assumed weak compared to Hq, the dynamics generated by H' is slow 
compared to the dynamics generated by Hq. This time scale separation translates into 
the fact that, at each moment of the evolution, the system is approximately stationary 
with respect to Hq so that it can be described by a GGE with slowly evolving Lagrange 
multipliers plus a small correction Sp{t). This leads to the following ansatz for the density 
matrix of the system: 


Kt) = PGGE{t) + 6p{t), 


PGGE{t) = 


exp(-Efc^fc(t)4) 

Tr[exp(-X;fc-^fc(i)4)] 


(281) 


If the system thermalizes, one expects that the Lagrange multiplier associated with the 
energy approaches the inverse temperature (3, while all others approach zero. 

Since H' is small, it is convenient to go to the interaction picture with respect to Hq: 

pj{t) = e-*^“*p(t) e*^“*, H'{t) = (282) 
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where von Neumann’s equation becomes: 

idtpi{t) = [H'{t),pi{t)]. 


(283) 


Our strategy will be to solve the von Neumann equation (283) using time-dependent 


perturbation theory with the GGE ansatz as the initial condition. This will allow us to 
find small changes in the expectation values of the integrals of motion {Ik)^ which in 


turn define the Lagrange multipliers in the GGE (281). In this way, the slow evolution 
of Pgge is determined self-consistently: 


dt{ik) = dt (Tr[4p/ 


= Tr 


= iTr 


H' 


(284) 


where we used Eq. (283), and that Ik is commute with Hq and thus remain time inde¬ 


pendent in the interaction picture. To leading order of perturbation theory in we 

have pi{t) K, pgge and therefore 


dt{Ik) ~ *Tr (Ik Pgge, H'{t) 


= 0 . 


(285) 


The last equality follows from the cyclic property of trace and the fact that pgge and 
Ik commute. Therefore, we have to go to the next order of perturbation theory: pi{t) ~ 


Pgge + dpi{t), where from Eq. (283) 


rt—to 


6pi{t) RS i f dt'lpGGE, = i I drlpGGE, &'{t - r)]. 

tn 0 


(286) 


Here, to is some arbitrary time in the past. By substituting this correction to Eq. (284), 
we obtain 


dt{ik)=iT^T(ik[ypi{t),H' 


rt-tn 


dTTll^Ik pGGE,H'{t-T) ,iL'(t) ^ 


rt-tn 


dr {[[Ik, H'{t-T)],H\t)]) GGE, (287) 


where, once again, we have used the cyclic property of the trace and the fact that 
Pgge and Ik commute. To simplify this expression further, we note that the correlation 


functions appearing in Eq. (287) depend only on time differences. Also, because the 


relaxation of nearly conserved integrals of motion is very slow compared to the time 
scales set by Hq, the correlation functions appearing in the integral above decay fast so 
one can take the limit t — to ^ oo. After these simplifications, one obtains the desired 
quantum kinetic equations for the integrals of motion 


dt{ik)-- / dt{[[Ik,H'{0)],Il'{t)])GGE. 


(288) 


Both the expectation value on the LHS and RHS of the equation above can be written in 


terms of the Lagrange multipliers specifying pgge [see Eq. (281)]. Solving these equations, 
it is possible to determine the evolution of the Lagrange multipliers and therefore the 


note that the GGE density matrix is not affected by the transformation to the interaction picture. 
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slow relaxation of the GGE to the thermal equilibrium state. Being a set of coupled 
scalar equations, Eq. (288) is much simpler than the original von Neumann equation. 

It is instructive to rewrite the kinetic equations using the Lehman representation in 
the basis of Hq. Using the identity 


POO 

/ = 7r6(en - Cm) + V- 

Jo ' 


, (289) 

and, for simplicity, assuming that both the Hamiltonian and the integrals of motion are 


as 


real, we rewrite Eq. 

dt{Ik) = - p^^^){n\ik\n)\{n\H'\m)\‘^d{e'i - e^). 


(290) 


where is the eigenvalue of Hq corresponding to eigenstate |n), i.e., Holn-) = 
this form, it becomes clear that the thermal distribution (where pnn is only a function 
of energy) is a stationary solution of these kinetic equations, i.e., dt{Ik) = 0 for any Ik- 
Also, the delta function of (e^ — e^) ensures that dt{Ho) = 0. So relaxation to thermal 
equilibrium occurs in the presence of energy conservation. Both properties are, of course, 
expected from general considerations. 


Let us now apply the kinetic equation (287) to a common setup dealing with a gas 


of weakly interacting particles, bosons or fermions. For simplicity, we assume that they 
are spinless. Also, to shorten notations, we will use a scalar notation for the momentum 
modes, keeping in mind that this can be a vector index. Then the Hamiltonian reads 


Hn = 


^kcick 


(291) 


For the integrability breaking term, we take the usual (normal ordered) density-density 
interactions 


= (b = Y 

,^2 5^3 


M,k3,ki Cfca ^ki 


(292) 




For translationally invariant interactions, Vk^^k 2 ,k 3 ,k 4 . is nonzero only when ki + k 2 = 
fca + k^, and it depends only on the transferred momentum q = ki — k^. But, since our 
formalism applies even if translational invariance is broken, we will keep the interac¬ 
tion matrix element in the most general form. The obvious integrals of motion are the 
momentum occupation numbers hk = c|.c^. Let us first compute the commutator 


[hk^,H'] = 2 


k2,k3,k, 


M,k3,ki Cfca ^ki ^2 ,k3,k' ,kiC^i ; 


(293) 


where we used the invariance of the interaction matrix element with respect to permuta¬ 
tion of ki with k 2 and k^ with ^ 4 . Plugging this into Eq. (288) and using Wick’s theorem, 
which works for any GGE with quadratic integrals of motion, we find 


hk' ~ Ihvr E (hfc'h/j2^fc3^fc4 'klk'nk2^k3^ki)\^k',k2,k3,ki\ T €^2 ^ki)i (294) 


k2,k3,ki 


where hfc = 1 ± Ufc with a plus sign referring to bosons and a minus sign referring to 
fermions and Uk = {hk)- Glassical kinetic equations are obtained by taking the limit 
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nfc <C 1 and effectively replacing by unity. Solving these kinetic equations can be 
tedious, but it is numerically feasible for very large systems. Let us check that the thermal 
distribution is a fixed point of these kinetic equations. For example, for fermions, the 
equilibrium distribution reads 


nk 


1 

1 + exp[/3(efe - n)] 


(295) 


then 


hfc'hfcj '^k2^k3^ki — (1 ^fc2)^fc3^fc4 f^k'^^k^iX ^^ 4 ) 


g/3(efc'+£fc2_ g/3(efc3+efc^—2/i) 




0, (296) 


where the last equality relies on the total energy conservation. With more effort, one can 
show that the equilibrium fixed distribution is the attractor of the kinetic equations. 

This example connects the ideas of GGE as a generic stationary state of integrable 
systems, prethermalized states as slowly evolving GGE states, and the kinetic theory as 
the perturbation theory in time describing the final evolution to thermal equilibrium. As 
previously mentioned, more work is needed to understand the generality of this approach 
and its applicability to systems other than those for which one can take the occupations 
of the single-particle eigenstates to be the nearly conserved quantities. 
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Appendix A. The Kicked Rotor 

In this appendix, we discuss how chaos emerge in the simplest setup, namely, a driven 
single-particle system in one dimension. In the presence of driving there is no energy con¬ 
servation and, as a result, the system is not integrable. The Hamiltonian of the classical 
kicked rotor reads 


„2 ~ 

H{p,x,t) = — — Kcos{x) ^ 5{t — nT). (Al) 

n=—oo 

If one thinks of p as angular momentum and x as the canonically conjugate angle, this 
Hamiltonian describes a freely rotating particle that is periodically kicked at times t = 


no 
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nT. We choose the minus sign in front of K so that the time-averaged Hamiltonian 
reduces to a conventional pendulum with an energy minimum at x = 0 (there is no loss 
of generality as this sign can always be changed by x —)• x -|- vr). For simplicity, in what 
follows we refer to p and x as momentum and position, respectively. 

The equations of motion for the kicked rotor are 

dx , dH dp , dH 

= = E (A2) 

n=—oo 

These equations can be easily integrated between kicks. Let us denote by pn and Xn the 
momentum and the position of the particle, respectively, just before the n-th kick, i.e., 
at time t = nT — e, where e —)> 0. Then the equations of motion result in the following 
recursion relations 


Xn+l = X„ + rp„+l, Pn+1 = Pn - Ksm{Xn). 


(A3) 


These equations provide a discrete map (known as the Chirikov standard map) that 
allows one to uniquely determine the position and the momentum of the particle. If 
nT < t < {n + l)r, then p{t) = Pn+i and x{t) = Xn + Pn+i{tniodT). Note that one can 
confine momentum to any periodic interval of length 27r/T. Indeed, from Eq. (A3), it is 
obvious that shift of the momentum by 27r/T and the coordinate by 27r leaves the map 
invariant. Let us analyze the time evolution that follows from this map. The dynamics 
is determined by the kick strength K, the period T, and the initial values of p and x. 
The map depends only on the product KT so we can set T = 1 and analyze the map as 
a function of K keeping in mind that iL <C 1 is equivalent to the short period limit. 


If iL <C 1 and po <C 1, from Eqs. (A3), one can see that both p and x change very 


little during each period. Hence, instead of solving discrete equations, one can take the 
continuum limit and write 


dx 

dn 


dp d‘^x 

’ p, 7 — = —K sin(x) —)• ——7 ~ —K sin(x) 
dn dn^ 


(A4) 


This equation describes the motion of a pendulum in a cosine potential, which is regular. 
Depending on the initial conditions there are two types of trajectories, corresponding to 
oscillations {p <C K) and full rotations {p ^ K). A careful analysis shows that one does 
not need to assume that p is initially small, the only crucial assumption is that AT <C 1 . 

Next, one needs to check the stability of the obtained trajectories. It might happen 
that, if one includes corrections to the continuum approximation, chaos occurs. However, 
as proved by Kolmogorov, Arnold, and Moser (KAM) |62H64] . this is not the case. As 
mentioned in the Introduction, the KAM theorem states that regular motion is stable 
against small perturbations. Eor the kicked rotor problem, one can check the stability of 


the solution above perturbatively. In particular, Eqs. (A3) can be written as 


Xn+l - 2Xn + Xn-1 = -K sin(Xn). 


(A5) 


If K is small, we can assume that x is a nearly continuous variable. By expanding in 
Taylor series one gets 


d?x 1 d^x 
dn^ 12 dn‘^ 


—K sin(x). 


(A 6 ) 


Erom the unperturbed solution, we see that (at least in the localized regime) the natural 


111 






August 2, 2016 


Advances in Physics Review 


frequency of oscillations is ^/K. This means that, in Eq. (A6), the term with the fourth 
derivative is proportional to K^, that is, it is small when K 1. 

When K is large, the continunm approximation for the map fails, p and x “jump” from 
kick to kick. Since both are determined modulo 27r, one may guess that the motion is 
chaotic. A rigorous analytical proof that this is the case does not exist. Hence, we discuss 
indications for the occurrence of chaos for large values of K by analyzing the stability of 
the hxed points of the map: 


Xn+l = Xn+ Pn+1 = Xn, Pn+1 = Pn - K sin(Xn) = Pn- (A7) 

There are only two possible solutions: = 0, = 0 and pn = 0, = vr. Now, let us 

perturb the trajectories and see whether they remain stable. The linearized equations 
read 


5Xn+l — 2(5Xn + 6Xn-l = —K COs{Xn)SXn = ^K6Xr, 


(A8) 


where the minus and pins signs refer to the fixed points x = 0 and x = tt, respectively. 


In Eq. (A8), one might recognize the eqnation of motion of coupled harmonic oscillators, 


where TAT plays the role of the frequency sqnared. For a harmonic chain, it is standard to 
introdnce normal Fourier modes, i.e, A = exp[ig]. Here, we need to be carefnl because the 
frequency, V ±K, can be imaginary. Because this is a translationally invariant system, we 
seek the solution of the form 6xn+i = X6xn = X^Sxq. Using onr ansatz for the solution, 


Eq. (A8) rednces to a simple quadratic equation 


- (2 T K)X + 1 = 0, 


(A9) 


which has two solutions 


, K / 

Ai,2 = l+y ±y+K+—. 


(AlO) 


Let us analyze these solutions separately for the two hxed points. For x = 0, correspond¬ 
ing to the ” sign, we have two solutions 


K [1^ 

Ai,2 = i-y±y—- a:. (All) 

For 0 < AT < 4, the expression in the square root is negative leading to an imaginary 
contribution to A. In the same range of AT, the absolute value of the real part of A is 
smaller than one. This means that the solution is stable. Indeed, if one introduces a small 
deviation to the stable position then, as the discrete time n increases, that deviation does 
not grow. Moreover, in this range, we can check that 

lA^I = (1 - a:/2)2 + a:-a:V4 = 1 (A 12 ) 

implying that A = exp[ig], as for a harmonic chain. This means that any small deviation 
leads to oscillations around the hxed point. 

If AT > 4, the outcome of introducing a small deviation is completely different. This is 
because now there are two real solutions for A. The solution with the negative sign. 


K 

A 2 — 1 — — 
2 



(A13) 
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Figure Al. Phase-space portrait (Poincare cross-section) of the kicked rotor for different values of the parameter 
K. From left to right, K = 0.5, 0.971635, and 5. Images taken from scholarpedia [3081 . 


satisfies IA 2 I > 1- This means that any small deviation from the fixed point grows ex¬ 
ponentially in time without bound, at least in the linearized regime. This exponential 
growth does not prove that the motion is chaotic, but is a strong indicator of it. The expo¬ 
nent characterizing the rate of growth, ln(A), is called the Lyapunov exponent. In chaotic 
systems with many degrees of freedom, there are many Lyapunov exponents. Typically, 
the largest one determines the rate of divergence of nearby phase-space trajectories. 

The analysis of the other fixed point, with x = tt, is even simpler 

K I ^ 

Ai,2 = 1 + y ± Y + ^- (A14) 

Clearly, for any positive K, there are two real solutions with one larger than one, that is, 
this point is always unstable. This is not surprising since this fixed point corresponds to 
the situation where a mass sits at a potential maximum. It is interesting that if, instead 
of S kicks, one applies a fast periodic drive to the pendulum: K = Kq -h asm(iyt), one 
can stabilize x = vr to be an equilibrium position. This is known as the Kapitza effect (or 
Kapitza pendulum), and can be seen to occur in many physical systems [ 66 l 119711307] . 

In Fig. jAl] we show phase-space portraits of the kicked rotor for different values of K. 
For small values of K (left panel), the motion is essentially regular everywhere except 
in the vicinity of the unstable fixed point x = tt, p = 0. As iL increases, chaotic regions 
gradually cover a larger and larger fraction of phase space. At iF = Kc (center panel), 
with 0.971635 < Kc < 63/64 |309II311| . there is a percolation transition. Isolated chaotic 
regions for K < Kc percolate through phase space for K > Kc- This implies that the 
system can increase its energy without bound. For sufficiently large values of K, chaotic 
regions cover phase space almost entirely (right panel). 

The Chirikov standard map can be quantized. A discussion of the quantum map can 
be found in Ref. [308] . and references therein. 


Appendix B. Zeros of the Riemann Zeta Function 

Here, we discuss a remarkable manifestation of RMT, which highlights a connection 
between CUE statistics and prime numbers. There is no clear understanding of the 
origin of this connection and, at the moment, it represents one of the biggest mysteries 
associated with prime numbers. The Riemann zeta function (/(s) is formally defined (for 
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3^(s) > 1) as 


= (bi) 

n>l 

For other values of s, it is defined by an appropriate analytic continuation through the 
integral: 

1 fOO s—l 

C{s) = ^ dx (B2) 

r(s) Jo - 1 

As proved by Euler in 1859, the Riemann zeta function is related to prime numbers 
(again for 51?(s) >1): 


C{») 


n 

p=prime 


1 

1 — p~^ 


(B3) 


The proof of this result is simple and elegant. Notice that we can construct a function 

his) 


his) - Cis) - ^Cis) - 1 + — + — + . , (B4) 

which is a sum that lacks terms that are inverse powers of integer multiples of 2. Similarly, 
one can construct his) 

hO) = h(s) - = as) (i - ^) (i - ^) = 1 + ^ + ^ + + ■ ■ ■. (B5) 


which is a sum that lacks terms that are inverse powers of integer multiples of 2 and 3. 
Continuing this exercise, and using the fundamental theorem of arithmetic, i.e., that any 
number has a unique decomposition into prime numbers, ones proves that, as n —)• oo. 
Inis) —t 1, and hence proves Eq. 


Equation (B3) allows one to map C(s) onto the partition function of a noninteracting 
harmonic chain in which the frequencies of the normal modes are related to the prime 
numbers. The partition function for a single oscillator is 


^p(/3) = 


1 


n\ = 


1 - exp[-/3a;p] ‘ 


(B6) 


If we associate prime numbers with normal modes, ojp = ln(p), and require that (3 = s, 
then 


Zi/3) = llzpiP) = Cil3). (B7) 

p 

The (complex) zeros of the zeta function are thus the (complex) zeros of the partition 
function of this model. The zeros of the partition function are known as Fisher zeros 
which are closely related to Yang-Lee zeros [313j . Condensation of these zeros near the 
real temperature axis is an indicator of a phase transition [312] . Recently, it was conjec¬ 
tured that concentration of Fisher zeros at complex infinity is related to the ergodicity 
of the system HU!. Physically, the Fisher zeros correspond to the zeros of the Fourier 
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0.0 0.5 1.0 1.5 2.0 2.5 3.0 

normalized spacing 

Figure Bl. Distribution of the spacings between approximately one billion zeros of the Riemann zeta function 
near zero number 10^3 + 17,368,588,794, and the statistics of level spacings in the Gaussian unitary ensemble 
(continuous line). These results were provided by Andrew Odlyzko (see also Ref. [316]). 


transform of the energy distribution function P{E) (closely connected to the Loshmidt 
echo and zeros of the work distribution for the quench problems |31f)j ). Indeed 

P{E) = ^ E exp[-/3i?n]5(i5^n - E) (B8) 

where 

Z(/3) = J]exp[-/?K] (B9) 

n 

is the partition function. Hence 

W{t) = J dEP{E)ex.p[iET] = -iT)En] = (BIO) 


Thus, in the physics language, the complex zeros of the partition function Z((3 — ir) 
correspond to the zeros of the Fourier transform of the energy distribution for a system 
of phonons with normal modes given by the natural logarithm of the prime numbers. 

Riemann’s zeta function has many fascinating properties. One of them is that the non¬ 
trivial zeros of C('S)j i-e., zeros that are nonnegative integers, lie on the line iR(s) = 1 / 2 . 
This conjecture is called the Riemann hypothesis and it remains one of the greatest un¬ 
solved problems in mathematics. By now, it has been checked for the first 10^^ zeros 
[316) . Remarkably, the distribution of the normalized spacings of the zeros of Riemann’s 
zeta function is the same as that of level spacings in the GUE (see Fig. Bl). This agree¬ 
ment hints, as mentioned before, a deep connection between prime numbers and random 
numbers. 


115 














August 2, 2016 


Advances in Physics Review 


Appendix C. The Infinite Temperature State as an Attractor 


To prove that the infinite temperature distribution is an attractor, let us recall that any 
stochastic matrix M or, equivalently, Markov matrix has one eigenvalue Aq = 1, while 
all the other eigenvalues have absolute value less or equal than one, i.e., |Aq,>o| < 1 jl9U| . 
The eigenvalue Aq = 1 clearly corresponds to the steady state of the system while the 
others denote processes where the probability distribution decays to the steady state. The 
left and right eigenvectors corresponding to the eigenvalue Aq = 1 satisfy the relation 

LqIVI = Lq, JVI Rq = Rq. (Cl) 


Note that, by construction, the right eigenvector Rq is the steady-state probability dis¬ 
tribution of the system so that its elements should, by normalization, sum to 1. By the 
conservation of probability, i.e., Mn^m = 1, and by direct substitution, it is easy to 
see that the left eigenvector is given by the constant vector Lq = (1,1, • • • , 1). Note that 
with this choice Lq • -^o = 1- Of course, the right eigenvector Rq depends on the details 
of the Markov matrix M and in general has a nontrivial structure. 


To see this, let us decompose the vector of initial probabilities pnn in terms of the right 


eigenvectors as' 


35 


D D 

^ ^ ^ CaRa — Rq ^ ^ CaRa 

a =0 Q ;>0 


( 02 ) 


where we have used the fact that the coefficients Cq, are determined by projection on the 
left eigenvectors therefore cq = 1. Plugging the expression above 

into the master equation (|106), we obtain 


D 




/ 1 


(A, 


N p 
a J -^a 


Rq 


(03) 


where we have used that cq = 1, Aq = 1, and assumed that |Aa>o| < 1 so that A^^q ss 0 . 
This equation shows that p^^'^ approaches the stationary state Rq exponentially fast in 
the number of processes N. The only exception to this result is when the doubly stochastic 
matrix admits other eigenvalues with absolute value one. This situation, however, is not 
generic. It corresponds to systems that are not ergodic so that some portions of the 
configuration space cannot be accessed from others, see Appendix [D| 

Doubly stochastic matrices p are a special subgroup of Markov matrices which, besides 
satisfying the conservation of probability = Ij satisfy the additional constraint 

'^^Pn^rn = 1- This additional property allows one to prove that the right eigenvector 
corresponding to the Aq = 1 eigenvalue has the specific form R^^ = [l/D, - ■ ■ , 1/D)'^, 
where D is the dimension of the matrix p. Therefore, for a doubly stochastic matrix, 
the stationary state is the “infinite temperature state”. To this end, we simply check 
explicitly that = Rq^: 







/d-A 

pRt=P 


= 


= 



(i/d) 


1 ^ j 


1 ^ / 


(C4) 


this discussion, we ignore cases in which the Markov matrix cannot by diagonalized. The proof can be extended 
to these cases as well. 
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where the third equality follows directly from the doubly stochastic condition (91). Next, 
we prove that |Aq| < 1. Let us assume that there is an eigenvalue Ao larger than one. 
Then, the corresponding eigenvector grows exponentially under the repeated action 
of p, that is, Ra = This implies that there are entries of p^ that are larger 

than one. However, p^ is a product of doubly stochastic matrices so it is itself a doubly 


stochastic matrix (see Sec. 5.1.1). Therefore, all its entries need to be smaller than one. 


see Eq. (93). As a result, we conclude that there cannot be any eigenvalue larger than 


one. 


Appendix D. BirkhofF’s Theorem and Doubly Stochastic Evolution 

Birkhoff’s theorem |191j states that any doubly stochastic matrix is given by the convex 
sum of permutation matrices IIq,: 


p = ^A:anQ,, '^ka = l, 0 < fca < 1. 

a a 


We can then rewrite the doubly stochastic master equation ( 88 ) as 


p(') = ppW = 


^ ^ ka^aP 


( 0 ) 


(Dl) 


(D2) 


When only one permutation matrix contributes, the master equation simply describes a 
perfect transfer of population between two states. For example, for a three level system 
in which only one permutation matrix contributes, say, between states one and two, the 
permutation matrix is given by 


nio2 



(D3) 


and the master equation above simply reduces to: 


( 1 ) ( 0 ) ( 1 ) ( 0 ) ( 1 ) ( 0 ) 

dll = d 22 b P 22 = dil , d33 = d33 


(D4) 


If we apply the master equation with this transition matrix, the system will enter a cycle 
that will neither lead to an entropy increase nor to an energy increase. 

In general, however, many permutation matrices contribute to the master equation. 
Using Birkhoff’s theorem, it is easy to see that the most general doubly stochastic matrix 
for the three level system considered above is 

( 6 + /3 a + rj 7 + ^\ 

a + p (5 + 7 /3 + 7/j, (D5) 

7 + r/ + p 6 + a J 

which describes the transfer of probabilities between states 1 and 2 with weight a, be¬ 
tween states 2 and 3 with weight /?, between states 1 and 3 with weight 7 , no transfer 
of probabilities with weight 5, cyclic permutations 1 —?■ 2 —?■ 3 —?■ 1 with probability p, 
and cyclic permutations 3 —?■ 2 —?• 1 —?■ 3 with probability p. Note that the sum of the 

weights is one, that is, Q; + /3 + 7 + (5 + r? + /i = l, so that the matrix above is doubly 
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stochastic (i.e., the sum of each row and each column is one). Also note that, while p is 
not symmetric in general, it is symmetric when only pairwise permutation are present. 

The most general master equation for a three level system, using p in Eq. (D5), has 
the form: 


Pn = (<5 + + (a + r])pf^ + (7 + 

P 22 = (a + P)pn + (<^ + 7)4? + (/3 + P)p33 

P 33 = (7 + P)pn +W + p)p'S + (<5 + a)^? > 


and is already quite complicated when compared to the particular case in Eq. (D4). The 
complexity increases as the number of states increases. However, as we discussed in the 
main text, the doubly stochastic form of the transition matrix leads to several important 
general consequences (see also Ref. m)- 


Appendix E. Proof of (W) > 0 for Passive Density Matrices and 
Donbly-Stochastic Evolution 

In this appendix, we prove that {W) > 0 if the initial density matrix is passive and the 
evolution is doubly stochastic (see also Ref. |188j i. 

Let us arrange the energy levels in order of increasing energies and, hence, by passivity, 
decreasing occupation probabilities, that is, Ei < E 2 < ... and p® > P 22 ^ ■ P^dd- 

We also assume that the Hilbert space size D is hnite (one can always take the limit 
D —^ 00 at the end). The general expression for the average work is 


{W) = Y,p^^^Em-Y.P^^lEn 

m n 

— y ^ PnnPn^mEm — ^ ^ p^^^En = ^ ^ Pnn ^ ^ Pn^mEm — En 
m,n n n \_ m 


(El) 


where the sums over n and m go from 1 to D. Next, we define 

— Pnn PD-k,D-k- 


(E2) 


Clearly, from the passivity condition (121), = pnn — > 0 . Then, we rewrite 

Eq. (lEll) as: 


D 


(W) = 


n=l 


■ D 

E 

.m=l 


Pn- 


Erfi 




(E3) 


where, using the doubly stochastic condition for transition rates, that is, X]n=i Pn^m = 1, 
one can show that 




D 


n=l 


D 


^ ^ Pn^mEm E^ 


.m=l 


= 0 . 


(E4) 
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Finally, noting that = 0, we rewrite (E3) as 


D-l 


{w) = Y.aI 


n=l Lm=l 


D 


^ ^ Pn^mEm -E'n 


(E5) 


Next, we write in terms of A^, that is, A° = A^ + j^_i — P^Pd, and plug it 
in the equation above to obtain 


D-l 


{W) = Y.aI 


n=l Lm=l 


D 


^ ^ Pn^mEm E^ 


+ K\ 


(E6) 


where 




( 0 ) 


D-l 


n=l Lm=l 


D 


^ ^ Pn^mEm E^i 


> 0 . 


(E7) 


To see why > 0 note that: (i) the passivity condition, Eq. (121), implies £)_i > 


P^ddi that (ii) the remaining sum can be rewritten as 


D-l 

E 


n=l 


D 




.m=l 


D 


D 


D-l 


Pn^mEm E PD^mEm ^ ^ Ej^ 


m,n=l 


D 


D-l 


D 


m=l 

D 


n=l 


D 


^ ^ Em ^ ^ En ^ ^ PD^mEm — E£) ^ ^ pjy^mEm ^ Ejj ^ ^ P£)^mE£) — 0 , 

(E8) 


m=l 


n=l m=l 


m=l 


m=l 


where we have used the doubly stochastic (91) condition multiple times. Einally, noting 
that A1,_^ = 0, we rewrite Eq. (E6) as 




■ D 

E 

_m=l 


Pn- 


¥mEm Eyi 


(E9) 


Equation (E9) is similar to Eq. (E5) except that the external sum in Eq. (E9) extends 
only up to H — 2 and not to H — 1 as in Eq. (E5). 

The proof continues iteratively. For example, in the next iteration, we write A^ in 


terms of A^, that is, Aj, = A^ + p^^^^D-i “ pS-i,d-i 


to obtain 


D-2 


(IT) > A 


n=l 


■ D 

E 

_m=l 


Pn^mEfii — En 


+ K^ 


(ElO) 


where 


— \PD-2,D-2 PD-1,D-1) Z_^ 


(0) 


D-2 


n=l 


D 


^ ^ Pn^mEm En 


.m=l 


(Ell) 
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The prefactor {p^d_2 d-2 ~ P^D-i D-i) positive by the passivity condition, Eq. (121). 
Moreover, using similar steps as above, we rewrite the sum as 


.( 0 ) 


D-2 

■ D 





D 


E 

^ ^ Pn^mEm 

~ Eji 

= Ed 

+ 

Ed-1 

^ ^ P{D—l)^m 

n=l 

_m=l 





m=l 









D 




= Ed 

+ 

Ed-1 

1 

to 

b^ 

-. + E 








m=l 




> Ed 

+ 

Ed-1 

— “^Ed- 

-1 + [P{D 




IV 

tq 

b 

+ 

Ed-1 

— “^Ed- 

-1 + {Ed- 


D 


m=l 


(E12) 


where we have defined 5Em = E^-i — Em- In the third line, we have used that 5Em > 0 
for any m < D — 1 and, in the fourth line, we have used that < 1, 

which is guaranteed by the doubly stochastic condition Yl,mP'm^D = 1- We therefore 
conclude that K 2 > 0. Finally, noting that ^‘jy _2 = 0 we arrive at: 


D -3 

{W) 

n=l 


■ D 

E 

,m=l 


Pn- 




(E13) 


where now the external sum extends only up to H — 3. 

Clearly, comparing Eqs. (E5), (E9) and (E13), we see that at each iteration the upper 
limit of the external sum decreases by one and the index k in 


increases 


by 


one. 


Continuing this iterative process, eventually, the external sum will include only one 
element proportional to which is zero by definition [see Eq. (E2)]. Therefore one 

can conclude that (W) > 0. 


Appendix F. Derivation of the Drift Diffusion Relation for Continuous 
Processes 


We already showed how one can derive the drift diffusion relation (177) by means of a 
cumulant expansion of the Evans-Searles fluctuation relation (157). Here, we show how 
the same result can be derived directly from the ETH ansatz (62) applied to continuous 
driving protocols. In particular, let us focus on a setup in which the external parameter 
A, conjugate to the observable O = —d\H, changes in time at a constant rate. For 
example, this parameter can be the position of a macroscopic object moving in some 
media (quantum or classical). 

Within leading order in adiabatic perturbation theory (see Ref. m for further de¬ 
tails), the energy dissipation in the system is given by (we have set fi = 1 so that the 
energy has dimension of time“^): 


^ pa ^ ■^^—^{n\d\m){m\d\n)d{En - Em), (FI) 

n,m 

where pn and pm are the stationary probabilities to occupy the many-body eigenstates 
|n) and |m) corresponding to the energies and Em, respectively. All matrix elements 
and energies here correspond to the instantaneous value of A. For the Gibbs distribution. 
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Pn oc exp[—l3En], it is easy to see that 


pn Pm 
Em Erfi 




Em) — Ppn^iEn 


Em) 


(F2) 


and Eq. (FI) reduces to the standard expression for the energy dissipation (see, e.g., 
Ref. |317j ). As in many places in this review, let us focus instead on the dissipation from 
a single many-body energy eigenstate, pn = <5n,no- If the relation holds for any eigenstate, 
it holds for any stationary distribution with subextensive energy fluctuations. For a single 
eigenstate, Eq. (El) becomes 


dQ 

dt 


2A' 


’ E —^— 

^ E — E 

my^no 


{no\d\m){m\0\no)6{Eno - E„ 


(F3) 


Let us now use the ETH ansatz (62) and, as usual, replace the summation over the 
eigenstates by an integration over uj = Em — E^g = Em — E: Ylm f dco exp[5(E + cj)] 
(for simplicity, we drop the index no in the energy). Then 


^ = 2 X‘^V f duj 
dt I 


^S{E+uj)-S{E+uj/2) 


UJ 


-\fo{E + uj/2,uj)\^5{uj), 


(F4) 


where V f stands for the principal value of the integral. Noting that 


^S(E+i^)-S(E+u;/2) _ ^Puj/2 ^ j ^ ^ 

lfo(E + CJ/2,UJ)I^ = lfo(E,cj)l^ + '^dj,lfo(E,uj)l^ + 


(E5) 


and using the fact that |/o(E,a;)p is an even function of uj, we find 


-^^Je = A2[/3|/o(E,0)|2 + 5e|/o(E,0)|2]. (F6) 

We note that, formally, the function \fo{E, 0)p diverges in the thermodynamic limit (see 
Fig.[T3). However, physically, this divergence is cutoff by the inverse relaxation time in 
the system, which defines the broadening of the (5-function in Eq. ( |F1[ ). 

Similarly, from adiabatic perturbation theory, one can show that 

= De = 2A2 (no|d|m)(m|d|no)(5(E„„ - Em) = 2X^\fo{E,0)\\ (E7) 

m^UQ 


Comparing Eqs. (E6) and (E7), we recover the desired drift-diffusion relation (177). 


Appendix G. Derivation of Onsager Relations 


In this appendix, we derive Eqs. (179) and (200). Since Eq. (200) is more general, we 
show its derivation first and then obtain Eq. (179) as a special case. First, in the Crooks 
relation (200), we expand the entropy and the probability distribution as a function of 
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El,II ± W and Vj II zb 5N to second order in W and 6N to obtain 


P(Ei,^ii,iVi,Vii,W,,5iV)exp 


-A/3W - Ak6N 


-^dEA/3 


6 N^ 

~Y~ 


BnAk 


(Gl) 


W6N 

2 


(Sat A/3 + OeSk) 


exp [WdE + <5V5jv] P{Ei, En, Ni, Nn,-W, -6N), 


where A/3 = /3i —/3ii, Ak = ki — ku, /3i = ^j5i, ki = and similarly for II (I —)• II). 

The partial derivatives Oe and are understood as derivatives with respect to energy 
and particle exchange: dEf{Ei,Eii) = dE^fiEi, Eu) — dE^fiEi, Eu). Next, one needs to 
integrate over W and 6N and perform the cumulant expansion to second order. Following 
the discussion after Eq. (173), we keep only the terms linear in the cumulants to obtain 


- AP{W) - Ak{6N) - i^9^A/3 - ^-^^OnAk - (OnA/3 + OeAk) 

+ + APAk{WSN)c 

= —dE{W) — On/SN) + -9|;e(^^)c + 2^‘nn{^E^‘^)c + 6N)c ■ 

(G2) 


Using the following identities: 

{W^)cdEAf3 =dE (A/3(IU2),) - ApdE{W‘^)c, 
{6N‘^)cdNAK =dN [Ak{6N‘^)c) — AKdN{SN‘^)c, 
{W5N)c {BnAP + OeAk) =dN{A/3{WSN)c) - A/3dN{W5N)c 

+ dE{AK{W6N)c) - AKdE{W6N)c, 


we can rewrite Eq. (G2) as 


(A/3 - dE) 
{Ak - On) 


-{W) + + ^{W5N), + ^dE{W^)c + ^d^{W5N), 

-{SN) + ^{6N^)c + ^{WSN), + ^dN{SN^)c + ^9b(IU,5V) 


+ 


c 


(G3) 

= 0 . 


Since this relation holds for any value of A/3 and Ak, each term in the square brackets 
must be zero, leading to Eq. (200). By assuming that the systems do not exchange 
particles, that is, that 6N = 0, we recover Eq. (179). 
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